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Abstract 

Formulating  physical  theories  using  real-valued  space  and  time  often  raises  the  difficult  issue 
of  clustered  variation:  states  that  change  their  value  infinitely  often  in  a  finite  interval,  or  infinite 
sequences  of  events  that  occur  in  a  finite  interval.  Some  physical  theories  permit  nonsensical 
models  unless  clustered  variation  is  prohibited;  others  force  clustered  variation  to  occur.  This 
paper  surveys  a  number  of  technical  issues  involved  with  clustered  variation,  without  attempting 
to  find  a  uniform  treatment  of  the  issue.  We  present  examples  of  theories  where  clustered 
variation  must  be  allowed  and  of  theories  where  it  must  be  prohibited.  We  discuss  the  meaning 
of  plans  containing  infinite  series  of  events.  We  present  constraints  on  real-valued  fluents  and 
spatial  fluents  that  guarantee  discrete  variation  of  important  associated  states.  We  show  that 
requiring  discrete  variation  may  be  inconsistent  with  allowing  infinitely  many  objects  in  the 
universe,  even  if  each  object  is  individually  well-behaved.  We  show  how  the  need  for  requiring 
discrete  variation  may  depend  subtly  on  the  particular  ontology  chosen  for  the  domain. 


1      Introduction 

Viewed  from  a  modern  perspective,  the  great  insight  underlying  the  paradoxes  of  Zeno  the  Eleatic 
is  that  the  commonsense  understanding  of  time  and  space  is  profoundly  confused  on  the  subject 
of  their  infinite  divisibility.  On  the  one  hand,  it  seems  absurd  to  suppose  that  two  instants  or  two 
points  can  be  distinct  and  yet  have  no  instant  or  point  between  them.  On  the  other  hand,  if  space 
and  time  are  dense,  then  the  arrow  trying  to  reach  its  goal  or  Achilles  trying  to  catch  up  with  the 
tortoise  must  execute  infinitely  many  movements,  which  does  not  seem  right  either. 

Twenty-five  hundred  years  of  subsequent  mathematics  and  philosophy  have  pretty  well  settled  the 
mathematical  and  physical  problems  here.  The  consensus  after  millenia  is  that,  for  most  practical 
purposes,  one  may  take  space  and  time  to  be  dense  and  complete  (i.e.  9?")  and  that  it  is  consistent 
to  say  both,  "Achiles  executes  the  action  'Go  to  the  current  position  of  the  tortoise'  infinitely  many 
times,"  and  "Achilles  catches  up  to  the  tortoise  in  finite  time."  This  theory  leads  to  paradoxes 
even  stranger  than  Zeno's:  the  expression  of  a  discontinuous  function  as  the  infinite  sum  of  con- 
tinuous functions;  space-filling  curves;  nowhere-differentiable  continuous  curves;  the  Banach-Tarski 
paradox;  and  so  on.  Some  of  these,  such  as  Fourier  series,  have  been  accommodated  within  the 
language  of  physics;  others,  such  as  the  Banach-Tarski  paradox,  are  characterized  as  mathematical 
constructs  without  physical  realization.   The  dividing  line  between  the  two  is  generally  left  to  the 


•This  research  has  been  supported  by  NSF  grant  #IRI-9001447.   Thanks  to  my  father  Philip  Davis  and  to  Bud 
Mishra  for  some  mathematical  pointers. 


physical  intuitions  of  the  human  understander.  These  intuitions  change  over  time,  of  course.  In  the 
nineteenth  century,  nowhere-differentiable  curves  were  mathematical  monstrosities;  today,  they  are 
an  important  element  of  our  descriptive  language  in  a  whole  range  of  applications. 

All  these  problems  come  up  again  when  we  try  to  characterize  commonsense  domains  for  au- 
tomated reasoners.  A  theory  of  a  domain  that  uses  real-valued  time  must  prescribe  which  of  the 
mathematically  possible  behaviors  can,  in  fact,  occur  within  the  domain.  Particularly  knotty  prob- 
lems are  associated  with  clustered  variation  or  chatter,  a  Boolean  fluent  that  change  its  truth-value 
infinitely  often  in  a  finite  time  interval,  or  an  infinite  sequences  of  events  that  occurs  within  a  finite 
time  interval.  For  example,  should  a  theory  allow  a  light  to  be  on  during  time  instants  corresponding 
to  rational  numbers  and  off  during  time  instants  corresponding  to  irrational  numbers?  Should  it  al- 
low a  light  to  be  on  during  the  interval  t  €  [0, 1/2),  off  during  t  E  [1/2, 3/4),  on  during  t  £  [3/4,  7/8), 
off  during  t  G  [7/8,  15/16)  . . .?  Similarly,  should  it  allow  the  execution  of  an  infinite  loop  in  which 
the  first  iteration  is  executed  in  1/2  second,  the  second  in  1/4  second,  the  third  in  1/8,  and  so  on?1 

A  number  of  researchers  (e.g.  [Hayes,  85],  [Fleck,  88]  [Kaufmann,  91]),  noting  that  such  clustering 
can  neither  be  perceived,  due  to  the  limited  resolution  of  human  perception,  nor  actually  occur, 
due  to  the  atomic  and  quantized  nature  of  very  small-scale  physics,  have  suggested  that  3?"  is  a 
poor  choice  as  a  model  for  space  and  time,  and  that  some  other  topology  be  found.  However,  the 
alternatives  that  have  been  suggested  seem  to  lead  to  constructs  as  peculiar  as  those  encountered 
in  3?"  (see,  for  example,  the  definition  of  a  "straight  line"  in  [Kaufmann,  91])  and  there  are  many 
basic  physical  phenomena,  such  as  rotation,  that  have  not  received  any  adequate  treatment  in  these 
theories.  (Fleck's  proposed  topology,  which  uses  3?"  as  a  substrate,  may  be  an  exception.  However,  it 
is  not  clear  how  her  approach  can  be  applied  to  physical  theories.)  Therefore,  it  seems  to  me  that  the 
suggestion  of  alternative  topologies  must  be  taken  as  conjectural  until  someone  can  use  them  in  fully 
developed  physical  theories.  I  do  not  think  that  the  two  motivations  mentioned  above  are  promising 
leads  toward  better  theories  of  space  and  time.  The  issue  of  limited  perceptual  resolution,  I  believe, 
is  best  treated  as  an  aspect  of  perception,  rather  than  as  an  fundamental  quality  of  space  and  time 
[Davis,  89a].  The  quantized  nature  of  small-scale  physics  involves  phenomena  that  are  much  further 
from  a  commonsense  understanding  than  the  difficulties  of  real-valued  space  and  time. 

An  alternative  policy  would  be  to  use  real-valued  space  and  time,  but  to  impose  well-behavedness 
conditions  on  physical  entities.  One  attempt  at  such  a  policy  was  axiom  9  of  [McDermott,  82] 
(proposed  by  me).  Let  us  say  that  a  Boolean  fluent  varies  discretely  if  it  nowhere  exhibits  clustered 
variation;  that  is,  if  it  changes  its  value  only  finitely  many  times  in  any  bounded  time  interval. 
(An  equivalent,  formally  simpler,  definition  is  given  in  Theorem  1  of  the  appendix.)  Axiom  9  of 
[McDermott,  82],  which  we  will  here  call  the  "axiom  of  discrete  variation",  posited  that  every  Boolean 
fluent  must  vary  discretely.3 

1  The  use  of  real- valued  space  and  time  also  raises  other  problems  of  small  scale  geometry,  such  as  "Do  solid  objects 
occupy  open  or  closed  point  sets?"  "Can  an  solid  object  occupy  a  two  dimensional  surface?  or  a  shape  that  joins 
itself  at  a  point?"  "Must  the  surface  of  an  object  be  everywhere  smooth?  or  almost  everywhere  smooth?"  These 
problems  are  also  very  hard,  and,  it  turns  out,  closely  connected  to  the  problems  of  clustered  variation  addressed 
here.  Different  from  either  of  these  is  the  phenomenon  of  stutter,  discussed  in  [Forbus,  85],  in  which  a  physical  system 
departs  from  dynamic  equilibrium  for  an  instant,  and  is  immediately  returns  to  the  equilibrium  state.  This  is  not 
mathematically  possible  for  continuous  functions;  rather,  it  constitutes  a  proof  by  reductio  that  the  system  must 
maintain  the  equilibrium  state. 

2It  may  be  observed  that  some  infinitary  paradoxes  have  been  of  great  importance  in  physics:  OhJber's  paradox 
that  the  night  sky  should  be  infinitely  bright;  the  classical  calculation  that  a  black  body  should  radiate  infinite  energy; 
the  infinite  self-energy  of  a  point  electron;  and  so  on.  However,  there  is  a  critical  difference  between  these  paradoxes 
and  those  of  clustered  variation;  namely,  that  the  physical  paradoxes  remain  problematic  even  if  the  quantities  are 
not  infinite  but  merely  very  large.  If  the  universe  is  large  enough,  then  the  sky  should  be  much  brighter  than  it  is, 
and  if  the  classical  account  of  thermal  radiation  is  anywhere  close  to  right  then  the  radiation  of  a  black  body  should 
be  much  larger  than  it  is,  and  if  an  electron  is  small  enough,  then  its  mass  would  have  to  be  much  greater  than  it 
is.  By  contrast,  clustered  variation  raises  no  problem  until  it  becomes  truly  infinite;  very  frequent  variation  is  not 
problematic. 

3  A  similar  but  weaker  rule  was  proposed  in  [Hamblin,  71].  [Galton,  87]  discusses  some  philosophical  treatments  of 


"The  method  of  'postulating'  what  we  want  has  many  advantages;  they  are  the  same  as  the 
advantages  of  theft  over  honest  toil."  [Russell,  19].  It  soon  became  clear  that  imposing  the  axiom  of 
discrete  variation  creates  two  serious  difficulties.  The  first  is  that  the  axiom  of  discrete  variation  can 
only  hold  if  certain  restrictions  are  placed  on  the  the  language  of  states,  the  behavior  of  real- valued 
fluents,  and  the  shapes,  motions,  and  placements  of  objects.  For  example,  "the  temperature  of  the 
tea  being  a  rational  number"  must  be  excluded  as  a  possible  state.  The  function  x(i)  —  <sin(l/i) 
must  be  excluded  as  a  possible  behavior,  if  the  condition  x  >  0  is  to  be  allowed  as  a  fluent.  These 
restrictions  were  discussed  in  [McDermott,  82],  but  not  in  adequate  depth. 

The  second  difficulty  is  that  a  number  of  physical  theories  actually  predict  the  occurrence  of 
clustered  variation.  These  theories  are  idealizations,  but  they  are  reasonable  and  useful  idealizations, 
that  can  only  be  avoided  at  substantial  cost.  The  best  known  of  these  (discussed  in  [Fleck,  88])  is 
the  bouncing  ball  (Figure  1).  The  simplest  and  most  straightforward  model  of  a  ball  bouncing  on 
the  ground  is  that  the  ball  is  a  rigid  object,  and  that  at  each  bounce,  the  ball  bounces  up  with  a 
speed  that  is  a  fixed  fraction  p  of  the  speed  with  which  it  hit.4  Since  the  time  between  bounces  is 
proportional  to  the  speed  leaving  the  ground,  it  follows  that  the  time  between  bounces  decreases  in 
a  geometric  series,  and  that  therefore  the  ball  attains  perfect  rest  in  a  finite  time,  having  carried  out 
infinitely  many  bounces.  (Note  that  this  is  different  from  a  standard  damped  harmonic  oscillator, 
in  which  the  frequency  is  constant,  and  the  system  therefore  in  principle  never  reaches  a  state  of 
absolute  rest.)  Admittedly,  this  is  an  idealization;  with  an  actual  ball,  beyond  a  certain  point, 
the  system  will  be  dominated  by  the  internal  vibrations  of  the  ball,  which  have  a  fixed  frequency. 
However,  to  take  this  into  account  would  require  modelling  the  ball  as  an  elastic  object,  which 
greatly  complicates  reasoning. 

It  is  important  to  note  that  this  prediction,  though  peculiar,  does  not  constitute  an  inconsistency 
in  the  theory  of  rigid  solid  objects.  The  position  and  the  velocity  of  the  bouncing  ball  approach  a 
unique  limit  as  the  bounces  go  to  zero,  so  its  state  after  the  infinite  series  of  bounces  is  well-defined. 
The  scenario  with  infinitely  many  bounces  does  not  contradict  any  of  the  laws  of  Newtonian  physics. 

A  common  reaction  to  this  example  is  to  propose  that  a  reasoning  system  should  treat  the  ball 
as  rigid  except  at  times  where  it  is  predicts  clustered  variation,  where  it  should  use  a  more  realistic 
model.  The  problem  is  that  in  a  general  reasoning  system,  the  prediction  of  clustered  variation  may 
be  subtly  hidden.  Consider,  for  example,  the  following  problem.  Given  that  a  ball  is  at  rest  on  a 
surface  at  time  i,  is  it  possible  that  the  ball  was  bouncing  any  time  before  <?  (We  assume  that  the 
ball  and  the  surface  are  isolated  from  other  interference.)  It  is  not  hard  to  imagine  (or,  for  that 
matter,  to  build)  a  program  that  constructs  the  following  chain  of  reasoning. 

Suppose  that  the  ball  was  bouncing  before  t.  Consider  the  last  bounce  B.  It  must  have 
attained  height  x  on  B,  and  therefore  hit  the  table  with  velocity  v  =  y/2gx.  But  then  it 
would  have  bounced  up  again  with  velocity  (iv,  which  contradicts  the  assumption  that 
B  was  the  last  bounce.  Therefore,  the  ball  has  always  been  at  rest  on  the  table. 

Note,  first,  that  there  is  nothing  obvious  anomalous  in  the  conclusion  to  trigger  the  suspicions 
of  the  program  that  something  is  wrong  —  nothing  that  would  force  the  withdrawal  of  a  default 
assumption,  for  example.  Note,  also,  that  the  assumption  of  discrete  variation  is  "hidden"  here  in 
the  innocent-looking  step  of  assuming  that  if  it  bounced,  then  there  was  a  last  bounce.  Neither  the 
program  nor  its  creator  need  ever  do  any  thinking  about  infinite  sequences  of  events.  Put  the  point 
another  way:  a  qualitative  simulator,  addressing  this  system,  might  well  generate  a  envisionment 
graph  in  which  there  was  no  transition  from  a  bouncing  state  to  a  rest  state  (Figure  2).  This  graph 
is  not  incorrect;  however,  it  would  be  a  mistake  to  conclude  from  it  that  a  ball  that  is  now  bouncing 
will  never  be  at  rest,  or  that  a  ball  that  is  now  at  rest  was  never  bouncing. 

clustered  variation. 

4 Its  velocity  may  be  posited  by  convention  to  be  the  limit  of  the  speed  from  previous  times.  [Davis,  88] 
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It  is  clear,  in  retrospect,  that  the  axiom  of  discrete  variation  was  substantially  overkill,  in  trying 
to  apply  to  all  fluents.  A  more  reasonable  approach  is  to  define  "discrete  variation"  as  an  interesting 
property,  and  then  to  posit  that  specific  key  fluents  have  this  property.  The  problem,  then,  is  to 
determine  which  fluents  can  and  should  be  so  constrained. 

Over  the  last  several  years,  I  have  worked  on  formal  characterizations  of  a  variety  of  physical 
theories  using  real- valued  space  and  time  (e.g.  [Davis,  88,  90,  91]).  Each  time,  the  issue  of  clustered 
variation  has  arisen  in  new  ways  and  created  new  and  peculiar  problems.  My  experience  has  been 
that  there  are  no  general  well-behavedness  conditions  that  can  be  imposed  across  the  board  on  all 
theories  of  this  kind.  Rather,  the  issue  must  be  thought  through  seperately  for  each  such  theory, 
and  for  each  fluent  in  the  theory.  The  kinds  of  well-behavedness  conditions  to  be  imposed  depend 
delicately  on  the  particular  range  of  physical  phenomena  considered;  the  particular  idealizations  of 
these  phenomena  that  are  adopted;  the  kinds  of  fluents  that  are  considered  as  first-class  entities; 
and  the  way  in  which  the  theory  addresses  other  issues  of  space  and  time. 

Accordingly,  this  paper  does  not  attempt  a  general  discussion  of  the  problem  of  clustered  varia- 
tion. Rather,  it  addresses  a  number  of  technical  issues  that  arise.  Section  2  discusses  further  why, 
aside  from  intuitive  discomfort,  one  should  want  to  exclude  clustered  variation  in  certain  theories. 
Section  3  presents  a  few  more  examples  of  natural  theories,  in  addition  to  the  bouncing  ball,  that 
generate  clustered  variation.  Section  4  discusses  infinite  sequences  of  events.  Section  5  presents 
conditions  on  the  types  of  states,  real-valued  fluents,  shapes,  and  motions  that  are  sufficient  to 
guarantee  discrete  variation.  Section  6  discusses  how  the  necessity  or  impact  of  requiring  discrete 
variation  may  differ  depending  on  how  the  world  is  carved  up  into  entities.  Further  mathematical 
developments  and  proofs  are  given  in  the  appendix. 

Brooks  [1991]  accuses  the  knowledge  representation  community  of  spending  their  efforts  on  formal 
anomalies  that  never  arise  within  in  actual  programs.  The  work  here  certainly  looks  like  that,  but  I 
believe  that  this  is  an  illusion.  I  do  not  claim  that  a  physical  reasoning  program  has  to  worry  about 
clustered  variation.  I  do  claim  that  a  physical  reasoning  program  whose  creator  has  not  worried 
about  clustered  variation  is  in  danger  of  generating  nonsense,  and  that  this  danger  increases  rapidly 
with  the  breadth  and  flexibility  of  the  program.  And  when  situated  automata  reach  a  point  of 
sophistication  where  they  take  into  account  more  than  what  is  immediately  in  front  of  their  nose, 
their  creators  will  also  have  to  confront  these  issues. 


2     Why  exclude  clustered  variation? 

The  original  introduction  of  the  axiom  of  discrete  variation  in  [McDermott,  82]  was  motivated  by 
the  following  example.  Suppose  we  wish  to  prove  the  following  theorem:  "If  day  is  always  followed 
by  night,  and  night  is  always  followed  by  day,  and  it  is  now  day,  then  it  will  always  be  either  day 
or  night."  It  is  easily  seen  that  this  theorem  cannot  be  proven  because  it  is  not  logically  necessary. 
The  following  scenario  is  perfectly  consistent:  Day  from  t=0  to  t=l/2;  then  night  from  t=l/2  to 
t=3/4;  then  day  until  t=7/8;  then  night  until  t=15/16  ...and,  starting  at  t=l,  a  state  that  was 
neither  day  nor  night.  In  such  a  case,  day  would  always  be  followed  by  night  and  night  by  day,  but 
eventually  it  would  be  neither  day  nor  night.  (Compare,  "If  each  bounce  of  the  ball  is  followed  by 
another  bounce,  then  the  ball  will  always  be  bouncing,"  which  is  not  necessarily  true.)  However, 
if  we  postulate  that  "DAY"  and  "NIGHT"  are  states  with  discrete  variation,  then  the  proof  goes 
through  directly. 

A  sceptic,  however,  could  argue  that  this  is  not  a  particularly  good  example  to  illustrate  the 
usefulness  of  requiring  discrete  variation.  Suppose  that  the  conclusion,  "It  is  always  day  or  night" 
does  not  follow  from  the  premise  "Day  is  followed  by  night  and  night  by  day,"  what  harm  is  there 
in  that  gap?  Why,  indeed,  should  one  want  to  derive  this  by  inference  at  all,  considering  that  the 


statement  "It  is  always  day  or  night,"  is  just  as  reasonable  an  axiom,  with  just  as  much  direct 
evidence,  as  "Day  is  followed  by  night,  and  night  by  day?" 

Let  us  therefore  modify  the  example.  Consider  a  finite  state  system  with  four  states  numbered 
0,  1,2,  3;  an  action  A,  which  adds  2  to  the  state,  mod  4;  and  an  action  B,  which  adds  1  to  the 
state,  mod  4  (Figure  3).  Now,  given  that  the  system  is  in  state  0  in  situation  SI  and  that  only 
action  A  occurs  between  SI  and  S2,  infer  that  at  S2  the  system  in  an  even-numbered  state.  This 
may  be  a  useful  thing  to  infer.  For  example,  if  you  are  playing  a  game  against  an  opponent  and 
you  wish  to  keep  the  game  state  out  of  odd-numbered  states,  it  is  useful  to  know  that  you  can  do 
this  indefinitely,  as  long  as  you  can  prevent  the  occurrence  of  any  actions  other  than  A.  Another 
example:  if  the  states  are  nodes  in  an  envisionment  graph  from  a  qualitative  physical  reasoner,  such 
as  QP  [Forbus,  85],  then  the  usual  interpretation  of  the  graph  includes  the  inference  that  the  system 
will  always  be  in  a  state  that  can  be  reached  on  a  path  through  the  graph. 

The  problem  is  that  the  result  does  not  follow,  if  clustered  variation  is  possible.  It  is  consistent 
with  the  theory  that,  if  action  A  occurs  once  over  the  first  half  of  the  interval,  and  then  a  second 
time  over  the  next  quarter,  and  a  third  time  over  the  next  eighth  . .  .then  at  time  S2,  after  infinitely 
many  iterations  of  A,  the  system  should  be  in  either  state  1  or  state  3.  And  this  is  not  due  to  a 
gap  in  the  logic;  this  really  can  happen  in  systems  where  clustered  variation  is  possible,  such  as  the 
bouncing  ball.  Any  single  transition  "collision  with  the  ground"  preserves  the  state  "ball  is  moving" 
but  the  infinite  sequence  of  these  transitions  does  not.5 

Clearly,  we  cannot  impose  an  axiom  of  the  form  "The  system  must  be  in  an  even-numbered 
state,"  analogous  to  the  axiom  "It  is  always  DAY  or  NIGHT;"  the  system  is  allowed  to  be  in  odd- 
numbered  states  at  times  before  SI  or  after  S2.  One  approach  would  be  to  add  a  frame  axiom,  "The 
system  cannot  change  from  the  parity  of  the  state  between  SI  and  S2  unless  an  action  of  type  B 
occurs."  But  such  an  approach  could  easily  lead  to  a  horrific  version  of  the  frame  problem.  One  can 
easily  construct  networks  in  which  each  different  subset  of  actions  generates  a  different  partitioning 
of  the  network  into  strongly  connected  components.  In  such  a  network,  one  would  need  a  different 
frame  axioms  for  each  such  set  of  actions;  2k  —  1  axioms  for  k  different  actions.  Since  all  these  axioms 
are  computable  from  the  network  description,  this  is  preposterously  inefficient.  The  only  reasonable 
approach,  therefore,  is  to  demand  that  the  states  of  the  network  vary  discretely,  in  which  case  the 
conclusion  follows  directly. 

A  second  example:  Suppose  we  posit  the  following  axioms: 

•  Given  that  window  W  is  in  good  repair  in  situation  51,  it  will  be  broken  in  situation  52  if  and 
only  if  some  object  has  hit  it  in  the  interval  (51,52]. 


• 


Object  O  is  responsible  for  breaking  window  W  if  and  only  if  there  is  a  situation  5  such  that 
W  is  in  good  repair  up  to  5  and  O  hits  W  in  5. 


We  would  like  to  infer  the  rule  that,  if  W  goes  from  being  in  good  repair  to  being  broken,  then 
some  object  is  responsible  for  breaking  it.  Again,  this  is  not  a  valid  inference.  Consider  the  scenario 
where  the  window  is  hit  by  some  object  at  time  t  =  1;  before  that,  it  had  been  hit  at  time  t  —  1/2; 
before  that,  at  time  i  =  1/4;  at  time  t  =  1/8  . .  .Then  certainly  the  window  is  broken  at  any  time 
t  >  0.  However,  since  each  hit  occurs  after  it  is  already  broken,  no  object  is  responsible  for  breaking 
it.  (This  might  come  up  in  a  question  of  legal  responsibility.  Obviously,  a  person  cannot  be  sued  for 
breaking  a  window  that's  already  broken.) 


5This  inference  may  also  fail  in  envisionment  graphs  derived  from  theories  with  quantities  of  infinitely  different 
orders  of  magnitude.  ([Davis,  89b],  p.  429) 
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3      Other  Examples  of  Clustered  Variation 

If  we  adopt  standard  Physics  I  idealizations,  such  as  massless  strings  of  zero  radius,  point  masses, 
and  the  like,  then  it  is  easy  to  generate  scenarios  that  show  clustered  variation.  However,  such 
models  run  into  all  kinds  of  problems  and  inconsistencies;  clustered  variation  is  the  least  of  their 
difficulties.  Within  more  reasonable  theories  of  mundane  domains,  I  have  found  three  types  of 
examples  of  clustered  variation:  bouncing  rigid  solid  objects,  idealized  reflected  light,  and  oscillators 
with  a  varying  control  parameter. 

Bouncing:  Our  original  example  above  of  clustered  variation  was  a  ball  bouncing  to  heights 
that  reduce  in  geometric  progression.  Other  more  complex  examples  can  be  contrived:  (1)  The 
restoring  force  need  not  be  gravity;  a  ball  bouncing  off  a  surface,  and  returned  by  an  electric  field 
or  a  spring  will  (in  principle)  behave  the  same  way.  (2)  A  ball  falling  through  a  smoothly  curved 
funnel  whose  mouth  is  just  as  large  as  the  ball  and  has  a  vertical  tangent  will  have  infinitely  many 
bounces  (Figure  4).  (3)  If  the  initial  velocity  and  the  coefficient  of  restitution  are  small  enough,  a 
ball  bouncing  between  two  heavy  objects  will  bounce  infinitely  often  as  the  two  objects  converge  to 
a  distance  exactly  equal  to  the  diameter  of  the  ball  (Figure  5).  The  detailed  analysis  of  (2)  and  (3) 
is  given  in  appendix  A. 

Reflected  light:  Consider  a  rectangular  box  of  dimensions  p  x  q  whose  internal  sides  are 
mirrors.  A  ray  of  light  is  released  from  one  corner  of  the  box  with  angle  0.  The  points  where 
successive  reflections  of  the  ray  will  hit  the  box  may  be  easily  calculated  by  continuing  the  ray  in 
a  straight  line,  and  considering  where  it  intersects  virtual  reflections  of  the  box  (Figure  6).  The 
coordinate  ck  where  the  ray  hits  the  left  wall  for  the  ifcth  time  can  thus  be  calculated  as  follows:  Let 
lk  =  2kpla.n0.  If  [lk/q\  »s  odd,  then  c*  =  -Ik  mod  2q\  else  ck  =  h  mod  2q.  However,  if  pta.n6/q  is 
irrational,  then  no  two  values  c;  and  Cj  are  equal;  hence  the  points  c*  are  densely  scattered  along 
the  line. 

If  we  modify  the  model  to  use  a  beam  of  limited  width  and  uniform  intensity  and  imperfect 
mirrors  that  reduce  intensity  by  a  fixed  fraction,  then  the  whole  internal  surface  will  be  lit  up  after 
finitely  many  reflections;  however,  the  intensity  of  illumination  will  be  a  fractal  function  of  position. 
The  same  will  hold  if  we  further  generalize  the  model  to  allow  a  limited  angular  spread  and  a 
non-uniform  intensity,  as  long  as  the  intensity  cuts  off  discontinuously  at  the  boundaries  of  beam. 

This  clustered  variation  is  spatial  rather  than  temporal,  but,  as  we  shall  see  in  section  5.4, 
the  two  are  closely  related.  The  idealization  breaks  down,  of  course,  when  we  consider  the  finite 
wavelength  of  light,  the  quantum  nature  of  light  intensity,  or  the  finite  speed  of  light,  but  in  ordinary 
applications,  it  is  convenient  to  idealize  away  all  of  these. 

Variable  frequency  oscillation:  Most  physical  oscillators  behave,  at  least  around  their  equi- 
librium point,  like  damped  harmonic  oscillators;  that  is,  they  oscillate  with  fixed  period,  and  ex- 
ponentially decreasing  amplitude.  Such  a  system  does  not  exhibit  clustered  variation,  since  its 
zero-crossings  are  evenly  spaced;  in  principle,  it  will  cross  zero  infinitely  many  times,  but  only  over 
infinite  time.  However,  if  the  period  depends  on  an  external  parameter,  and  can  be  made  to  go  to 
zero  by  adjusting  the  parameter,  then  clustered  variation  may  be  achievable.  For  example,  if  you 
run  your  finger  along  a  vibrating  violin  string,  then  the  frequency  is  inversely  proportional  to  the 
distance  from  the  contact  point  to  the  bridge.  In  principle,  if  you  move  your  finger  toward  the  bridge 
in  such  a  way  that  the  distance  from  your  finger  to  the  bridge  is  proportional  to  (tfO  —  t)A  then  the 
string  crosses  the  center  infinitely  often.  (Moving  your  finger  at  constant  speed  actually  does  not 
give  clustered  variation.)  Other  similar  examples  abound:  a  pendulum  whose  length  goes  to  zero 
in  finite  time  with  the  proper  time-dependence,  an  LRC  circuit  where  the  capacitance  goes  to  zero, 
and  so  on. 

The  example  of  the  vibrating  string  is  slightly  peculiar,  in  that  any  single  point  on  the  string 
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crosses  the  center  only  finitely  many  times.  We  will  discuss  this  further  in  section  7.  We  can  remove 
this  anomaly  by  supposing  that  you  are  moving  two  fingers  together  on  the  string.  Then  the  string 
at  the  meeting  point  crosses  the  center  infinitely  often. 


4      Infinite  sequences  of  events 

In  this  section,  we  consider  the  analogue  of  clustered  variation  for  events  and  actions;  should  it  be 
considered  possible  for  infinitely  many  events  to  occur  without  overlapping  during  a  finite  interval? 

Certainly,  if  we  admit  event  types  such  as  "O  moves  from  PI  to  P2,"  or  "O  moves  distance  X 
in  direction  D,"  then  Zeno's  paradoxes  show  that  the  occurrence  of  infinite  sequences  of  events  is 
unavoidable.  The  motion  of  the  arrow  from  0  to  1  entails  the  occurrence  of  the  infinite  sequence, 
"Arrow  moves  from  0  to  1/2;  arrow  moves  from  1/2  to  3/4;  ..."  Though  this  looks  a  little  odd,  it 
is  presumably  not  problematic;  it  is  just  a  clumsy  way  of  speaking  about  a  simple  reality.  As  an 
analogy,  we  may  observe  that,  if  the  sequence  of  discrete  events  "sequence^,  B,  C,  D,  E)"  occurs 
then  concurrent  with  occurrence  of  event  C  are  the  occurrences  of  events  such  as  "sequence(S,  C)," 
"sequence(C,  D)" ,  "sequence(,4,  B,  C)"  ...The  fact  that  nine  such  events  are  occurring  simultane- 
ously is  not  a  problem,  just  a  clumsy  description. 

The  problems  start  to  look  more  troublesome  when  we  switch  from  descriptive  to  imperative 
mode,  and  talk  about  plans  rather  than  event  descriptions.  Do  we  want  to  say  that  Achilles  can 
carry  out  the  plan, 

procedure  PI 

while  (place(me)  ^  1)  move-to((l  +  place(me))/2) 

starting  at  place=0  by  running  forward  to  1?  If  so,  what  about  the  plan 

procedure  P2 

while  true  move-to((l  +  place(me))/2) 


or 


procedure  P3 
begin  D  :=  1; 

while  (place(me)  ^  1) 
begin  D  :=  D/2; 

goJbrward(D) 
end  end 

If  such  iterative  plans  are  OK,  why  not  recursive  plans  like 

procedure  gol(D) 
begin  goJbrward(£)/2); 

gol(D/2) 
end 

procedure  go2(£>) 

begin  go2(£>/2); 

goJbrward(D/2) 


procedure  go3(D) 
begin  go3(D/3); 

go_forward(  D/3 ) ; 

go3(D/3) 
end 

Obviously,  a  plan  interpreter  implemented  along  the  lines  of  a  conventional  programming  lan- 
guage interpreter  will  bomb  with  each  of  these;  they  will  either  go  into  an  infinite  loop  that  takes 
infinite  time,  blow  the  stack,  or  run  into  underflow.  However,  if  plans  are  viewed  as  descriptions  of 
actions  to  be  reasoned  about  rather  than  as  programs  to  be  blindly  executed,  then  there  is  a  good 
case  to  be  made  that  these  plans  should  be  admitted  just  as  we  admitted  the  event  descriptions 
discussed  above. 

It  is,  in  fact,  quite  easy  to  define  the  semantics  of  planning  languages  in  a  natural  way  that 
admits  these  plans  as  correct.  For  example,  most  semantics  for  plans  with  conditionals  and  loops 
(e.g.  [McDermott,  82]  [Manna  and  Waldinger,  86])  make  the  assumption  that  the  evaluation  of 
the  test  is  instantaneous,  and  that  the  body  of  the  conditional  begins  simultaneously  with  the 
conditional.  If  so,  and  we  idealize  real  arithmetic  as  perfect,  then  PI  is  valid.  The  validity  of  P2, 
under  the  same  assumption,  depends  delicately  on  exactly  how  the  termination  conditions  for  a  loop 
are  defined.  The  assignment  statements  in  P3  need  a  little  more  care.  Clearly,  the  execution  of  an 
assignment  statement  cannot  be  instantaneous,  since  that  would  leave  the  order  of  execution  of  a 
sequence  of  assignments  badly  undefined.  However,  if  no  minimum  cycle  time  is  set  for  the  execution 
of  an  assignment  —  for  example,  if  it  is  consistent  with  the  semantics  that  the  assignment  should 
take  a  nanosecond  on  the  first  iteration,  half  a  nanosecond  on  the  second,  a  quarter  on  the  third, 
and  so  on  —  then  there  are  executions  of  P2  that  terminate  in  finite  time.  As  for  the  recursive 
programs  gol,  go2,  go3,  by  using  an  appropriate  minimal  fixed-point  definition  of  recursion,  we  can 
make  these  valid  as  well.  (In  effect,  we  define  the  execution  of  any  infinitely  deep  calling  sequence 
as  completed  instantaneously.) 

Why  should  we  care  what  the  semantics  does,  given  that  no  plan  interpreter  can  execute  these 
plans,  and  no  planner  will  ever  come  up  with  them?  Actually,  these  assumptions  may  not  be  true.  At 
the  interpretive  side,  it  is  not  hard  to  imagine  a  plan  optimizer  that  could  recognize  one  or  another 
of  these  forms  and  compile  it  into  the  instruction  to  move  forward.  (Compare  "tail-recursion" 
optimization,  which  allows  an  infinite  loop  for  an  interactive  program  to  be  written  recursively, 
without  ever  blowing  the  stack.)  At  the  planning  side,  it  is  not  as  obvious  as  it  looks  at  first  glance 
that  no  planner  will  ever  come  up  with  these.  In  fact,  if  you  consider  Achilles'  problem  of  overtaking 
the  tortoise  and  apply  a  G PS-like  stategy,  then  the  infinite  loop 

while  (place(me)  ^  place( tortoise))  move-to(place( tortoise)) 

is  exactly  what  you  generate.  In  fact,  if  the  motion  of  the  tortoise  is  not  known  in  advance  and 
cannot  be  extrapolated,  all  plans  for  overtaking  the  tortoise  will  involve  something  like  this.  Achilles 
can  overtake  a  slowly  moving  butterfly,  but  he  must  be  prepared  to  make  finer  and  finer  adjustments 
as  he  gets  closer  and  closer.  (It  should  be  noted  that  his  motion  will  not,  in  general,  observe  the 
condition  of  piecewise  analyticity  discussed  in  section  5.)  Of  course,  this  is  only  a  truly  infinite  loop 
if  one  assumes  infinitely  precise  perception  and  motion,  but  these  are  natural  idealizations  until  one 
gets  down  to  very  nitty-gritty  levels  of  robotic  detail. 

Either  way  we  go  on  this  choice  makes  substantial  trouble  in  reasoning  about  the  plans.  If  we 
rule  out  infinite  loops  by  requiring  (realistically)  that  entering  each  control  structure  requires  a  cycle 
of  time,  then  this  complicates  plan  verification  because  it  becomes  necessary  to  demonstrate  that 
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the  world  does  not  change  too  much  while  the  control  structures  are  being  interpreted.  If  we  admit 
infinite  loops,  then  we  have  to  reason  about  the  effect  of  an  infinite  loop,  which  gets  back  to  the 
problems  mentioned  in  section  2.  Moreover,  rules  such  as  "A  recursive  program  with  no  base  case 
is  invalid,"  is  very  useful,  and  it  would  be  a  pity  to  give  it  up. 

5      Consequences  of  discrete  variation  for  Fluents 

In  this  section,  we  consider  how  requiring  discrete  variation  affects  the  range  of  states  and  fluents 
that  can  be  considered.  We  consider  Boolean  combinations  of  states  (section  5.1);  fluents  onto  finite 
ranges  (5.2);  real-valued  fluents  (5.3);  and  spatial  fluents  (5.4).  We  also  show  that  guaranteeing 
discrete  variation  also  often  involves  restricting  the  local  area,  or  the  universe  as  a  whole,  to  contain 
only  finitely  many  objects. 

5.1  Boolean  combinations  of  states 

It  is  easily  shown  that,  if  A\  and  A2  both  vary  discretely  then  the  conjunction  of  Al  and  A2,  the 
disjunction  of  .41  and  A2  and  the  negation  of  A\  all  vary  discretely.  Thus,  any  Boolean  combination 
of  finitely  many  such  states  varies  discretely.  For  future  reference,  we  give  this  fact  a  number: 

Theorem  1:   If  A\  . .  .Ak  each  has  discrete  variation,  then  any  Boolean  combination  of  them 
also  has  discrete  variation. 

The  Boolean  combination  of  infinitely  many  states  may  have  clustered  variation  even  if  each 
single  state  varies  discretely. 

5.2  Fluents  with  Finite  Range 

Let  F  be  a  fluent  with  a  finite  range  D.  The  state  F(t)  =  c  varies  discretely  for  every  constant 
c  G  D  if  and  only  if  F  changes  values  only  finitely  many  times  in  each  finite  interval. 

Let  Fl  . .  .Fk  be  fluents  with  finite  range  and  discrete  variation;  let  O  be  a  k-place  function,  and 
let  P  be  a  k-place  relation.  Then  the  fluent  X(t)0{Fl(t) . . .  Fk(t))  and  the  state  X(t)P(Fl(t) . . .  Fk{t)) 
varies  discretely.  Recursively,  any  fluent  or  state  define  as  a  ground  term  over  Fl  . .  .Fk  varies  dis- 
cretely. 

5.3  Real- Valued  Fluents 

When  we  move  to  real-valued  fluents,  things  get  trickier.  The  requirement  that  F(t)  change  its  value 

only  finitely  many  times  in  a  finite  interval  is  much  too  strong;  it  would  rule  out  any  function  that 

changed  continuously,  such  as  the  identity  function  F{t)  =  t.  On  the  other  hand,  the  requirement 

that  the  state  F(t)  =  c  have  discrete  variation  for  each  constant  c  is  much  too  weak.  The  bizarre 

function 

„.  .  _  J   t        if  t  is  rational 

\   —t     if  t  is  irrational 

has  discrete  variation  for  each  state  F(t)  =  c,  but  has  clustered  variation  for  the  states  F(t)  >  0 
and  F(t)  <  0. 

We  can  rule  out  such  weirdnesses  by  requiring  that  both  states  F(t)  =  c,  and  F(t)  >  c  have 
discrete  variation.  But  even  this  stronger  condition  does  not  allow  us  to  do  everything  we  want  to. 


For  example,  the  function  F(t)  =  t  and  G(t)  =  t  +  <3sin(l/<)  both  satisfy  this  condition.  (Note 
that  (7(f)  is  monotonically  increasing  in  a  neighborhood  of  t=0.)  However,  the  state  G(t)  >  F(t) 
has  clustered  variation  in  the  neighborhood  of  t  =  0.  We  would  like  to  be  able  to  perform  basic 
operations  on  our  fluents  without  worrying  about  getting  into  trouble. 

Luckily,  there  is  a  helpful  theorem  about  power  series.  First,  three  definitions: 

Definition  2:  A  non-empty  subset  /  of  the  real  line  is  an  interval  iff  it  satisfies  the  following 
conditions:  if  A'  6  /,  1'  €  /,  and  X  <  Z  <  Y  then  Z  £  I.  This  includes  open,  closed,  half-open, 
and  single-point  intervals,  bounded  or  unbounded. 

Definition  3:  A  function  f(x)  on  the  reals  is  analytic  at  point  xq  if 

i.  f(x)  is  continuous  and  infinitely  differentiable  at  xq\ 

ii.  The  Taylor  series 

f(x0)  +  (x-  x0)f'(x0)  +  i(z  -  x0ff"(x0)  +  ... 

converges  to  f{x)  for  all  x  in  some  neighborhood  of  io- 

(An  equivalent  definition  is  that  f(x)  is  extendable  to  a  differentiable  function  over  a  neighborhood 
x0  in  the  complex  plane.  Condition  (i)  alone,  that  f(x)  is  continuous  and  infinitely  differentiable, 
is  not  sufficient  to  guarantee  discrete  variation,  contrary  to  a  misstatement  in  [McDermott,  82].  An 
example  is  the  function  f{x)  =  e~xlT   sin(l/r)  for  x  /  0.  /(0)  =  0.) 

Definition  4:  A  function  /  is  analytic  over  an  interval  /  if  /  is  analytic  at  every  point  x  £  I. 

We  now  have  the  following  useful  theorems:  (The  proofs  of  these  may  be  found  in  any  standard 
text  on  infinite  series  or  complex  variables.) 

Theorem  5:  If  F  is  analytic,  then  the  states  F(t)  =  c  and  F(t)  >  c  vary  discretely. 

Theorem  6:  If/  and  g  are  analytic,  then  the  functions  f  +  g,  f -g,  fg  and  f(g)  are  all  analytic. 
The  integral  and  derivative  of/  are  analytic.  The  quotient  f(x)/g(x)  is  analytic  except  for  x  where 
g(x)  =  0.  The  power  fa(x)  is  analytic  for  all  x  if  a  is  a  non-negative  integer,  and  analytic  for  all  x 
where  f(x)  >  0  for  all  other  a.  The  inverse  of  /,  restricted  to  a  range  where  it  is  uniquely  defined, 
is  analytic  at  any  point  where  f'(x)  f^  0. 

Remark  7:  A  lot  of  good  functions  are  analytic  for  all  x,  including  polynomials,  exponentials, 
sine  and  cosine,  and  so  on.  The  logarithm  is  analytic  for  all  positive  x.  Most  real-valued  functions 
that  come  up  in  practice  are  analytic  for  almost  all  x. 

One  type  of  closure  that  cannot  be  achieved  is  closure  under  solving  differential  equations.  There 
are  differential  equations  that  do  not  superficially  look  badly  behaved  but  whose  solutions  include 
functions  with  clustered  variations.  An  example  is  the  initial-value  system. 

y  +  y2  =  0. 
x  +  yAx=  0. 
j/(-l/7r)  =  -tt;  x(-1/tt)  =  0;  x{-\/n)  =  -V. 

For  t  <  0,  this  has  the  solution  y(t)  =  l/i;  x(t)  -  tsin(l/t). 

The  class  of  analytic  functions,  though  large,  is  not  adequate  for  AI  applications,  which  need  to 
model  discrete  functions.  Often,  the  behavior  of  a  fluent  will  change  suddenly,  even  discontinuously, 
as  the  result  of  an  event  such  as  a  switch  flipping,  a  moving  object  colliding  with  an  obstacle,  a 
heating  pot  of  water  reaching  boiling  point,  and  so  on.  If  we  can  suppose  that  such  event  occur  only 
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at  discrete  times,  then  we  can  widen  the  class  of  functions  to  include  the  corresponding  fluents  as 
follows: 

Definition  8:  Let  fa,  fa  ...  be  a  collection  (finite  or  infinite)  of  real-valued  functions.  Let  P(f) 
be  a  function  that  associates  a  real  interval  with  each  /,.  Assume  that  P  has  the  following  two 
properties: 

i.   P  is  disjoint  and  exhaustive.  That  is,  for  any  real  x  there  is  a  unique  fa  such  that  x  €  P(fk)- 

ii.  P  varies  discretely.  That  is,  if  J  is  a  bounded  interval,  then  J  contains  P(f,)  for  only  finitely 
many  i. 

We  then  define  the  splice  of  P  as  the  function  s(x)  such  that  s(x)  =  fi(x)  for  all  x  6  P(fi)-  (Figure 

7) 

Note  that  P  may  have  infinitely  many  different  values,  as  long  as  it  varies  discretely. 

Definition  9:  A  function  s(x)  is  piecewise  analytic  if  there  is  exists  a  collection  /i ,  fa,  . .  .and 
a  function  P(f, )  such  that 

i.   P  satisfies  the  conditions  of  definition  8  above, 
ii.  s(x)  is  the  splice  of  P. 
iii.   Each  /,  is  analytic  over  an  open  set  that  contains  the  closure  of  P(f,). 

Note:  In  condition  (iii)  above,  it  is  important  that  /,  be  analytic  over  the  closure  of  P(fi),  not 
just  over  P(fi).  For  example,  the  function  f(x)  =  xsin(l/x)  is  analytic  over  the  open  interval  (0, 
oo),  but  not  at  0.  Hence,  the  function  defined  to  be  0  for  all  x  <  0  and  xsin(l/x)  for  all  x  >  0  is 
not  considered  piecewise  analytic. 

We  then  have  the  following  theorems: 

Theorem  10:  If/  is  piecewise  analytic,  then  the  states  f(x)  =  c  and  f(x)  >  c  vary  discretely. 

Theorem  11:  The  class  of  piecewise  analytic  functions  satisfies  all  the  closure  properties  enu- 
merated for  analytic  functions  in  theorem  6  (assuming  that  some  convention  is  adopted  for  defining 
the  derivative  at  points  where  the  function  is  not  differentiable.)  Also,  if  f(x)  and  g(x)  are  piecewise 
analytic,  then  ma.x(f(x),g(x))  and  min(f(x),g(x))  are  piecewise  analytic. 

The  proofs  are  immediate  from  theorems  5  and  6,  together  with  definitions  8  and  9. 

There  are  some  functions  that  might  be  useful  that  still  not  included  in  this  definition,  including 
divergent  functions,  like  l/x  or  sec(x),  and  fractional  powers,  such  as  y/x.  If  we  need  these,  we 
can  define  a  class  of  piecewise  Dirichlet  functions.  The  definition  of  these  is  rather  complex  and  is 
deferred  to  Appendix  A,  section  8.3.  Here,  we  adduce  the  following  results: 

•  If  f(x)  is  piecewise  Dirichlet  then  the  states  f(x)  =  c  and  f(x)  >  c  vary  discretely. 

•  All  analytic  functions  are  piecewise  Dirichlet. 

•  The  class  of  piecewise  Dirichlet  functions  is  closed  under  addition,  subtraction,  multiplication, 
non-zero  division,  and  differentiation,  assuming  a  convention  for  defining  the  derivative  at 
non-differentiable  points.  If  f{x)  is  piecewise  Dirichlet  in  T  and  is  positive  over  interval  /, 
then  fa{x)  is  piecewise  Dirichlet  for  any  real  a. 
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However,  we  lose  closure  under  integration  and  composition. 

By  theorem  1,  if  the  states  f(x)  =  c  and  f(x)  >  c  vary  discretely,  then  any  finite  Boolean 
combination  of  these  also  varies  discretely.  But  a  finite  Boolean  combination  of  inequalities  is  just 
a  finite  union  of  intervals.  Thus,  if  S  is  the  union  of  finitely  many  intervals,  then  the  state  f(x)  G  S 
varies  discretely  if  f(x)  is  piecewise  analytic  or  piecewise  Dirichlet. 

What  about  more  general  states  involving  the  value  of /(x)?  Clearly,  if  a  state  P(y)  has  clustered 
variation  as  a  function  oft/,  then  it  would  be  unreasonable  to  expect  P(f(t))  to  vary  discretely  as 
a  function  of  <;  even  such  a  well-behaved  function  as  f(t)  =  t,  would  violate  that  condition.  For 
example,  we  cannot  expect  the  state,  uf(t)  is  a  rational  number"  to  vary  discretely.  However,  it  is 
easily  shown  that  any  set  of  real  numbers  that  does  not  have  clustered  variation  is  the  union  of  a 
discretely  varying  collection  of  intervals.  We  have  already  taken  care  of  the  case  of  a  finite  collection 
of  intervals;  the  only  remaining  case,  therefore,  is  an  infinite  collection  of  discretely  varying  intervals, 
such  as  "/(<)  is  an  integer,"  or  "sin(/(i))  >  0. 

If  /  is  the  union  of  infinitely  many  intervals  with  discrete  variation,  and  f(x)  is  piecewise  Dirichlet 
and  bounded  in  every  finite  interval,  then  the  state  f(x)  G  /  varies  discretely.  If  f(x)  diverges  for 
finite  x,  then,  in  general,  the  state  f(x)  G  I  does  not  vary  discretely.  Thus,  a  state  such  as  "/(<)  is 
an  integer"  or  "sin(/(i))  >  0"  has  discrete  variation  for  functions  such  as  f(t)  =  t  or  f(t)  =  e' ,  but 
not  for  /(<)  =  \/t. 

We  run  into  problems,  though,  if  we  have  two  scales,  one  of  which  is  the  reciprocal  of  the  other. 
In  that  case,  an  infinite  sequence  of  discrete  intervals  in  the  one  corresponds  to  a  cluster  point  in 
the  other.  For  example,  consider  the  relation  on  tones,  "This  tone  is  a  C."  If  we  consider  the  series 
of  higher  and  higher  C-tones  in  octaves,  the  frequencies  form  a  discrete  series  of  numbers,  but  the 
wavelengths  are  clustered  around  zero;  if  we  consider  lower  and  lower  C-tones,  the  wavelengths  are 
discrete  but  the  frequencies  are  clustered  around  zero. 

It  is  true  that  both  are  discretely  spaced  on  a  logarithmic  scale.  However,  a  logarithmic  scale  can 
only  be  used  if  that  zero  and  negative  values  are  guaranteed  to  be  meaningless.  We  can  get  away 
with  this  if  we  are  using  the  scale  only  for  wavelengths,  but  if  we  wish  to  measure  wavelength  using 
the  same  scale  as  for  other  quantities  of  the  same  dimensionality  such  as  distances  or  coordinates, 
the  assumption  is  not  acceptable. 

Of  course,  whether  the  state  P(f(t))  has  discrete  or  clustered  variation  over  time  does  not  depend 
on  of  the  scale  used  to  measure  the  range  space  of  /.  (For  example,  whether  the  state  "The  string 
is  vibrating  at  the  tone  of  C"  varies  discretely  over  a  time  period  is  a  physical  fact,  which  does  not 
depend  on  whether  we  define  C  in  terms  of  frequency  or  wavelength.)  The  point  is  that  it  is  not 
sufficient  to  require  that  the  real-valued  fluents  be  well-behaved;  it  is  also  necessary  to  show  that 
the  significant  states  in  their  range  spaces  vary  discretely;  and  this  latter  assumption  is  not  one  that 
can  be  made  universally  about  every  quantity  space. 

We  can  show  the  above  closure  results  to  analyze  relations  among  multiple  fluents.  For  example, 
if  f(x)  and  g(x)  are  piecewise  analytic,  then  so  is  f(x)-g(x)  ,  by  the  closure  theorem  11.  Therefore, 
the  relation  f(x)  >  g(x),  which  is  equivalent  to  f(x)  -  g(x)  >  0,  must  vary  discretely. 

5.4      Spatial  Fluents 

Reasoning  about  shape  and  motion  raises  problems  that  are  analogous  to  those  of  real-valued  fluents, 
but  in  a  more  complex  setting.  We  wish  to  guarantee  that  the  times  in  which  a  moving  point  object 
are  inside  a  specific  region  vary  discretely;  that  the  times  in  which  two  moving  regions  overlap, 
or  abut,  or  are  within  a  given  distance  of  one  another  vary  discretely;  and  so  on.   Ensuring  these 

6Going  up  an  octave  doubles  the  frequency,  and  halves  the  wavelength. 

12 


conditions  requires  restricting  both  possible  shapes  and  possible  motions. 

As  in  the  one-dimensional  case,  we  wish  both  to  include  both  a  wide  range  of  possible  shapes  and 
motions,  and  to  attain  useful  closure  conditions.  However,  I  have  not  been  able  to  prove  discrete 
variation,  even  over  the  set  of  shapes  and  motions  generated  by  simple  kinematic  theories.  Suppose, 
that  we  wish  to  posit  that  all  physically  real  motions  and  shapes  are  drawn  from  a  set  of  "acceptable" 
motions  and  shapes.  Presumably,  this  set  will  have  to  satisfy  certain  closure  conditions.  It  would 
not  be  acceptable,  for  example,  to  propose  a  theory  in  which  a  machine  that  could  be  built  out  of 
acceptable  shapes  would  generate  a  motion  that  was  not  an  acceptable  motion.  A  modest  set  of 
closure  conditions  might  include  the  following: 

•  Uniform  translation  and  uniform  rotation  are  acceptable  motions. 

•  The  class  of  acceptable  motions  is  closed  under  composition.  That  is,  if  each  of  two  coordinate 
systems  is  undergoing  an  acceptable  motion  relative  to  a  third,  then  their  motion  relative  to 
each  other  is  acceptable. 

•  A  sphere  is  an  acceptable  shape. 

•  The  connected  components  of  the  normalization  of  the  union,  intersection,  or  set  difference  of 
two  acceptable  shapes  are  acceptable. 

•  If  an  acceptable  shape  undergoes  an  acceptable  motion  for  a  finite  time,  the  shape  that  is 
swept  out  is  acceptable. 

•  If  a  collection  of  acceptable  shapes  are  assembled  into  a  linkage  with  one  degree  of  freedom, 
and  one  point  in  the  linkage  has  an  acceptable  motion,  then  all  points  in  the  linkage  have 
acceptable  motions. 

Unfortunately,  I  have  not  been  able  to  show  that  the  class  of  shapes  and  motions  generated 
by  these  closure  conditions  satisfies  the  discrete  variation  condition.  The  strongest  result  I  have 
show  is  that  the  class  of  semi-algebraic  shapes  with  piecewise  analytic  motions  satisfies  the  discrete 
variation  condition,  but  this  class  is  not  closed  under  "sweeping  out".  If  the  motions  are  restricted 
to  be  algebraic,  then  the  closure  conditions  hold,  but  they  no  longer  include  uniform  rotation,  which 
is  trigonometric.  If  the  shapes  are  generalized  to  be  analytic,  then  I  have  not  been  able  to  prove 
discrete  variation  or  closure  conditions,  though  I  conjecture  that  both  hold. 

Some  partial  results  are  stated  below. 

Definition  12:  A  region  is  a  subset  of  5?*. 

Definition  13:  An  algebraic  constraint  is  a  constraint  of  the  form  f(x)  >  0  or  f(x)  >  0  where 
/  is  an  algebraic  (multinomial)  function.  A  semi-algebraic  region  is  the  solution  space  of  a  finite 
Boolean  combination  of  algebraic  constraints. 

Theorem  14:  Let  R  be  a  semi-algebraic  region,  and  let  x(t)  be  a  piecewise  analytic  function  of 
time.  Then  the  Boolean  fluent  x(t)  €  R  varies  discretely. 

Proof:  Let  R  be  defined  by  the  Boolean  combination  of  the  functions  /i ,  /2  ■  •  •/*•  The  functions 
/,(£(<))  are  piecewise  analytic  functions  in  t;  hence  each  constraint  /,(£(<)  >  0  varies  discretely,  by 
theorem  5.  Hence,  by  theorem  1,  the  Boolean  combinations  of  these  states  also  varies  discretely. □ 

Theorem  15:  (Informal  statement)  Consider  two  objects  that  are  moving  rigidly  through  space. 
If  the  shapes  of  the  objects  are  semi-algebraic,  and  the  motions  are  piecewise  analytic,  then  states 
such  as  "The  two  objects  overlap,"  "The  objects  abut,"  or  "The  distance  between  the  objects  is  less 
than  D"  vary  discretely.  This  result  can  be  extended  to  objects  that  move  non-rigidly,  as  long  as 
the  shape  transformations  are  characterized  by  algebraic  functions. 
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Appendix  A  contains  a  precise  statement  and  proof  of  theorem  15. 

Definition  16:  An  analytic  constraint  is  a  constraint  of  the  form  /(£)  >  0  or  /(£)  >  0  where  / 
is  an  analytic  function.  A  semi-analytic  region  is  the  solution  space  of  a  finite  Boolean  combination 
of  analytic  constraints. 

Theorem  17:  Let  R  be  a  semi-analytic  region,  and  let  x(t)  be  a  piecewise  analytic  function  of 
time.  Then  the  Boolean  fluent  x(t)  €  R  varies  discretely. 

Proof:  Identical  to  proof  of  theorem  14. 

Conjecture  18:  Consider  two  objects  that  are  moving  rigidly  through  space.  If  the  shapes  of 
the  objects  are  semi-analytic  and  the  motions  are  piecewise  analytic,  then  states  such  as  "The  two 
objects  overlap,"  "The  objects  abut,"  or  "The  distance  between  the  objects  is  less  than  D"  vary 
discretely. 

Another  issue  concerns  real-valued  functions  defined  over  space,  such  as  temperature  or  pressure. 
We  would  like  to  guarantee  that  the  values  of  such  functions  encountered  by  an  object  moving 
through  space  are  well-behaved.  We  can  achieve  this  with  a  generalization  of  the  notion  of  a  piecewise 
analytic  function: 

Definition  19:  Let  s(x)  and  fi(x),f-2{x) ...  be  functions  from  5?*  to  3?.  Let  P  be  a  function 
mapping  each  of  the  functions  fi  to  a  region  in  3f*  such  that 

a.  P  is  disjoint  and  exhaustive.  That  is,  for  each  x  €  3?  there  is  exactly  one  /,  such  that  x  G  P{fi)- 

b.  For  all  i,  P(fi)  is  a  semi-analytic  region  of  3?  . 

c.  Any  bounded  S  C  5?k  interesects  P(fi)  for  only  finitely  many  i. 

d.  fi  is  analytic  over  an  open  set  that  includes  the  closure  of  P(fi). 

e.  For  x€P(fi),s(x)  =  f,(x). 

Then  s(x)  is  said  to  be  piecewise  analytic. 

Theorem  20:  Let  x(t)  be  a  piecewise  analytic  function  from  time  to  Rk ,  and  let  s(x)  be  a 
piecewise  analytic  function  from  3?*  to  3f.  Then  s(x(t))  is  a  piecewise  analytic  function  of  time. 

Proof:  Immediate  from  the  definition  and  theorem  17. □ 

5.5      Finitely  Many  Objects 

If  the  universe  contains  infinitely  many  objects,  then  states  of  the  form  "There  exists  an  object 
with  property  P"  may  have  clustered  variation  even  if  each  individual  object  is  well-behaved.  For 
example,  suppose  there  is  an  infinite  collection  of  light  bulbs,  and  bulb  #1  is  on  from  t  =  0  to 
t  =  1/2,  bulb  #2  is  on  from  t  =  3/4  to  t  =  7/8;  bulb  #3  is  on  from  t  =  15/16  to  t  =  31/32  and 
so  on.  Then  the  state  "Some  light  bulb  is  on"  has  clustered  variation,  even  though  each  individual 
light  bulb  is  well-behaved. 

Often,  states  of  physical  significance  depend  only  on  objects  that  are  in  some  bounded  region.  In 
that  case,  it  suffices  to  demand  that  in  there  are  only  finitely  many  objects  in  any  bounded  region  of 
space-time.  Suppose,  modifying  the  above  example,  we  have  a  rule  that  a  room  is  illuminated  just 
if  it  contains  a  lighted  light  bulb.  In  that  case,  it  is  possible  to  guarantee  that  the  state  "The  room 
is  illuminated"  varies  discretely  by  positing  that  the  state  "Light  bulb  N  is  on"  varies  discretely  for 
each  N,  and  that  during  any  finite  time  interval  there  are  only  finitely  many  light  bulbs  in  the  room. 
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Note  that  this  condition  is  strictly  stronger  than  the  condition,  "At  any  instant  of  time,  there 
are  only  finitely  many  objects  in  any  spatially  bounded  region,"  which  does  not  suffice  to  guarantee 
the  desired  result.  Consider  the  following  example.  Syldavia  is  interested  in  the  state  "There  are  no 
enemy  aircraft  in  Syldavian  airspace."  Freedonia  has  an  infinite  fleet  of  airplanes,  regularly  spaced 
on  an  infinite  landing  strip.  At  time  t  =  0,  plane  1  flies  from  its  spot  on  the  landing  strip  to  Syldavia, 
returning  to  its  spot  by  t  =  1/2.  Plane  2  does  the  same  from  t  =  3/4  to  t  =  7/8,  and  so  on.  No 
individual  plane  is  doing  anything  extraordinary  (except  for  going  faster  than  the  speed  of  light), 
and  the  planes  can  all  be  of  a  standard  shape  and  size;  there  is  no  need  to  posit  that  the  plane  size 
is  gradually  decreasing,  or  anything  of  the  kind.  Nonetheless,  the  state  "There  are  no  aircraft  in 
Syldavian  airspace"  has  clustered  variation. 

Imposing  the  condition  of  only  finitely  many  objects  in  a  bounded  region  of  space-time  may,  in  its 
turn,  require  further  restrictions  on  processes  that  create  objects  or  that  send  objects  into  regions. 
For  instance,  if  we  replace  the  airplanes  of  the  previous  example  by  bullets  fired  from  guns  spread 
out  on  the  landing  strip,  with  further  guns  firing  faster  and  faster  bullets,  then  we  must  posit  that 
the  guns  do  not  fire  in  such  a  way  as  to  send  infinitely  many  bullets  through  a  finite  space  in  finite 
time.  This,  note,  is  a  restriction  on  processes  that  happen  over  an  infinite  spatial  region.  If  we  space 
the  guns  and  fix  the  velocities  carefully  enough,  we  can  even  work  it  so  that  the  further  guns  must 
fire  earlier  and  earlier;  in  that  case,  it  is  a  restriction  on  processes  that  occur  over  an  unbounded 
region  in  both  space  and  time.  (If  gun  /  is  at  x  —  24/  and  fires  a  bullet  in  the  negative  direction  at 
time  -22/  -  l/22/_1  at  speed  22/,  then  the  bullet  will  be  between  x  =  0  and  x  =  -1  between  the 
times  -l/22/_1  and  -l/22/.)  Of  course,  the  guns  here  are  superfluous;  one  can  imagine  that  the 
bullets  have  just  been  pursuing  these  trajectories  since  time  immemorial,  never  looking  in  the  least 
clustered  until  t  =  0.  All  in  all,  it  may  simply  be  best  to  posit  that  in  all  space-time  there  are  only 
finitely  many  objects. 

An  example  of  creating  objects:  suppose  that  the  bouncing  ball  of  section  1  is  slightly  brittle, 
and  that  each  time  the  ball  hits  the  ground  a  small  piece  breaks  off  whose  size  is  proportional  to 
the  energy  of  the  collision.  Then,  after  the  bouncing  ball  has  attained  a  state  of  rest,  the  floor  will 
be  littered  with  an  infinite  collection  of  diminishing  objects,  which  can  proceed  to  make  trouble. 

It  is  often  useful  to  require  that  only  finitely  objects  exist  in  an  bounded  spatial  region  for  reasons 
independent  of  temporal  considerations.  For  instance,  in  the  blocks  world,  one  would  like  to  infer 
that,  if  a  block  is  on  top  of  a  tower  of  blocks,  then  it  is  resting  on  some  particular  other  block;  this  is 
guaranteed  if  there  are  only  finitely  many  blocks  in  the  tower.  Similarly,  in  a  theory  of  perception, 
one  would  like  to  infer  that,  if  there  is  an  object  blocking  your  view  in  a  particular  direction,,  then 
there  is  a  frontmost  object  in  that  direction;  again,  this  is  guaranteed  if  there  are  only  finitely  many 
objects  in  the  neighborhood,  and  objects  are  reasonably  shaped.  (Figure  8) 


6      Ontology 

The  status  of  discrete  and  clustered  variation  depends  critically  on  the  way  in  which  the  universe  is 
divided  into  entities. 

One  example  was  alluded  to  in  section  3.  Suppose  I  have  a  vibrating  violin  string,  and  I  run  my 
finger  to  the  bridge  in  a  finite  time  in  such  a  way  that  the  part  of  the  string  between  the  finger  and 
the  bridge  vibrates  infinitely  often.  If  we  consider  "The  vibrating  part  of  the  string"  to  be  an  entity, 
then  the  state  "The  vibrating  part  of  the  string  is  to  the  left  of  center"  has  clustered  variation. 
However,  if  we  take  the  individual  points  on  the  string  as  primitive  entities,  then  these  are  all  well 
behaved.  Each  point  crosses  the  center  line  a  finite  number  of  times,  and  then,  being  on  the  far  side 
of  the  finger,  remains  still. 
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In  a  recent  paper  axiomatizing  a  theory  of  cutting  solid  objects  [Davis,  91a],  I  ran  into  this  issue 
in  an  extreme  form.  In  this  paper,  I  proposed  an  idealized  model  of  cutting  in  which  the  blade 
change  the  shape  of  the  target  by  simply  annihilating  the  material  that  it  swept  through.  There 
were  two  natural  approaches  to  axiomatizing  this  theory.  The  more  obvious  approach  is  to  view  the 
target  as  an  object  whose  shape  is  gradually  modified  by  the  blade  until  it  is  split,  at  which  time 
the  original  object  ceases  to  exists  and  two  new  objects  come  into  being.  In  this  theory,  there  are 
three  central  axioms  of  change: 

1.  If  an  object  exists  from  51  to  52,  then  its  shape  in  52  is  equal  to  its  shape  in  51  minus 
whatever  has  been  carved  out  of  it. 

2.  Object  O  ceases  to  exist  in  situation  5  if  and  only  if  its  shape  is  connected  up  until  5  and 
becomes  disconnected  in  5. 

3.  Object  O  comes  into  existence  in  situation  5  if  and  only  if  it  is  one  connected  component  of 
another  object  02  that  ceases  to  exist  in  5. 

These  axioms  serve  both  as  causal  axioms  and  as  frame  axioms.  The  first  axiom  is  causal  if  something 
has  been  carved  out  of  the  object,  and  is  a  frame  axiom  if  nothing  has.  The  second  and  third  axioms 
are  causal  in  the  direction  "Change  if  condition"  and  frame  axioms  in  the  direction  "Change  only  if 
condition." 

The  more  subtle  approach  takes  a  chunk  of  stuff  as  its  primary  type  of  entity.  There  is  one  chunk 
for  every  (reasonably  shaped)  subset  of  the  material  of  an  object;  thus,  there  are  uncountably  many 
chunks  at  any  instant.  A  chunk  has  a  fixed  shape.  It  ceases  to  exist  when  any  part  of  its  material 
is  penetrated  by  a  blade.  The  one  rule  of  change  in  this  theory  is  that  a  chunk  is  annihilated  if  and 
only  if  it  is  penetrated  by  a  blade.  (Figure  9). 

The  theory  of  objects  is  terribly  sensitive  to  clustered  variation.  Since  dramatic  discrete  changes 
occur  whenever  an  object  is  split,  the  prospect  of  infinitely  splits  occurring  in  finite  time  sends  the 
whole  theory  into  conniptions.  In  a  situation  like  figure  10. A,  where,  intuitively,  the  blade  cuts  an 
infinite  sequence  of  small  objects  on  the  right  off  the  large  block  at  the  left,  the  theory  absolutely 
refuses  to  admit  that  the  block  on  the  right  can  still  exist  when  the  blade  has  reached  the  bottom. 
An  object,  in  this  theory,  can  come  into  existence  only  be  being  split  off  a  previous  object;  and 
here  there  is  no  previous  objects,  only  a  blur  of  previous  objects.  (Keep  in  mind  that  the  object 
changes  identity  each  time  a  split  is  complete.)  Even  more  peculiarly,  a  blur  of  objects  can  appear 
out  of  thin  air  if  the  splitting  of  small  pieces  is  clustered  into  the  past  (Figure  10. B)  It  is  therefore 
necessary,  in  this  theory,  to  enforce  rigidly  the  condition  of  discrete  variation,  which,  in  turn,  means 
that  motions  and  shapes  must  be  well  behaved,  and  that  only  finitely  many  objects  are  allowed  in 
any  finite  piece  of  space-time. 

By  contrast,  none  of  these  anomalies  give  the  slightest  trouble  to  chunk  theory.  In  chunk  theory, 
splitting  looks  exactly  the  same  as  shape  deformation;  history  consists  purely  of  a  stream  of  chunks 
being  destroyed.  No  restrictions  on  clustered  variation  or  shape  or  motion  or  number  of  objects 
need  be  enforced  at  all.  In  fact,  since  "object"  is  not  a  first-order  concept  in  the  language  of  chunks, 
there  is  no  way  even  to  state  these  restrictions. 

The  two  theories  are  provably  equivalent  in  cases  that  obey  the  restriction  that  any  bounded 
region  of  space-time  contains  only  finitely  many  objects. 
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7      Conclusion 

Perhaps  the  whole  issue  still  seems  spurious  and  irritating.  However,  if  we  follow  the  conventional 
practice  in  knowledge  representation  of  characterizing  theories  rigorously,  and  the  conventional  prac- 
tice in  physical  reasoning  of  using  real-valued  time  and  space  and  using  simple  physical  approxi- 
mations, then  these  problems  are  unavoidable.  Those  researchers  who  have  tried  to  develop  formal 
theories  of  physical  reasoning  based  on  real-valued  quantities  will,  I  think,  agree  that  these  infinitary 
paradoxes  are  not  a  side  issue,  of  interest  only  to  those  with  a  prurient  taste  for  paradox.  Rather, 
developing  such  theories  is  like  designing  structures  for  the  area  around  the  San  Andreas  Fault. 
Making  a  structure  earthquake-proof  is  an  irritating  distraction  from  the  main  task  of  building  a 
useful  structure,  but  it  is  really  not  safe  to  ignore  the  issue.  Nor  is  there  a  simple  formula  that  can 
be  uniformly  applied;  there  are  some  general  techniques,  but  each  new  type  of  structure  must  be 
thought  through  carefully.  Some,  of  course,  would  argue  that  this  is  a  sign  that  one  should  build  in 
some  other  region.  But  no  region  is  perfect,  and  the  Bay  area  has  many  advantages:  the  climate, 
the  lifestyle  . .  .and  it's  where  the  action  is  happening. 


8     Appendix  A 

8.1      Definition  of  Clustered  Variation 

In  this  section,  we  present  two  formal  definitions  of  clustered  variation  and  prove  them  equivalent. 

Definition  21:  An  interval  /  is  a  zone  of  boolean  fluent  A  if  A  has  the  same  truth  value 
throughout  /.  /  is  a  maximal  zone  of  A  if  /  is  a  zone  of  A  and  no  interval  proper  containing  /  is  a 
zone  of  A. 

Lemma  22:  For  every  situation  S,  there  is  a  unique  maximal  zone  of  A  containing  S. 

Proof:  S  is  certainly  contained  in  at  least  one  zone  of  A,  namely  the  point  interval  [5,5].  Let 
/  be  the  union  of  all  zones  of  A  containing  S.  It  is  straightforward  to  verify  that  J  is  a  zone  of  A, 
and  that  every  other  zone  of  A  containing  5  is  a  subset  of  /.D 

Lemma  23:  If  SI  <  52  <  53,  and  the  truth  value  of  A  at  52  is  different  from  its  truth  values  at 
51  or  53,  then  there  is  no  zone  containing  both  51  and  53.  In  particular,  51  and  53  are  in  different 
maximal  zones. 

Proof:  Immediate  from  the  definition  of  zone. D 

Lemma  24:  If  A  has  the  same  truth  value  at  51  and  52,  and  51  and  52  are  in  different  maximal 
zones,  then  there  is  an  53  such  that  51  <  53  <  52  and  A  has  the  opposite  truth  value  at  53. 

Proof:  If  A  has  the  same  truth  value  at  all  points  between  51  and  52  as  it  does  at  51  and  52, 
then  the  interval  [51,  52]  is  a  zone,  so  51  and  52  would  be  in  the  same  maximal  zone.D 

Definition  25.  A:  A  Boolean  fluent  F  has  discrete  variation  if  any  finite  interval  /  contains  only 
finitely  many  maximal  zones.  Otherwise  it  has  clustered  variation. 

The  above  definition  intuitively  corresponds  to  the  desired  meaning  of  clustered  variation.  How- 
ever, it  is  phrased  in  high-order  terms,  posed  in  terms  of  the  cardinality  of  a  set  of  intervals,  each 
interval  being  a  set  of  situations.  This  may  make  it  difficult  for  the  human  reasoner  to  use,  and  even 
more  difficult  for  the  automated  reasoner. 

An  alternative  equivalent  definition,  expressible  purely  in  terms  of  situations,  can  be  formulated: 

Definition  25. B  Fluent  A  has  discrete  variation  if,  for  each  situation  5,  A  has  a  constant  truth 
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value  throughout  some  open  interval  (51,5)  ending  in  5  and  a  constant  truth  value  throughout 
some  open  interval  (5,52)  starting  in  5.  Note  that  the  condition  "A  has  a  constant  truth  value 
throughout  the  interval  (SP,SQ)"  can  be  expressed  purely  in  terms  of  situations. 

constant^,  5P,5<2)  « 

[SP  <  SQ  A  Vsx.SY  SP  <  SX  <SY  <SQ  =>  [holds(5A'M)  «•  holds(5Y,.4)] 

discrete(.4)  O 

[Vs3si,S2  constant^,  51, 5)  A  constant^,  5, 52)] 

We  will  want  the  following  lemma  in  proving  the  two  definitions  equivalent. 

Lemma  26:  Let  T  be  a  infinite  totally  ordered  set.  Then  it  is  possible  to  construct  either  an 
infinite  increasing  sequence  S\  <  52  <  53  . . .  or  an  infinite  decreasing  sequence  Si  >  S2  >  S3  . . . 
where  each  Si  €  T. 

Proof:  One  of  two  things  must  be  true  of  T 

a.  There  is  a  non-empty  subset  To  of  T  that  has  no  largest  element. 

b.  Every  non-empty  subset  of  T  has  a  largest  element. 

In  case  (a),  let  5i  be  any  element  of  T0;  let  52  be  any  element  of  7b  that  is  larger  than  5i  .  .This 
is  an  infinite  increasing  sequence. 

In  case  (b),  let  50  be  the  largest  element  of  T  and  let  Ti  =  T  -  {50}.  Let  5i  be  the  largest 
element  of  Tu  and  let  T2  =  T\  -  {S\}.  In  general,  let  5,  be  the  largest  element  of  T,  and  let 
Ti+i  =  7,  -  {5, }.  Then  the  sequence  50,  S\ ,  52  . . .  is  an  infinite  decreasing  sequence. □ 

Theorem  27:  Definitions  25. A  and  25. B  are  equivalent. 

Proof:  Part  1:  Suppose  that  fluent  Q  is  clustered  by  definition  B.  Let  5  be  a  situation  where 
the  discreteness  condition  does  not  hold.  Assume  without  loss  of  generality  that  there  is  no  interval 
below  5  where  Q  is  constant.  Then  any  open  interval  ending  in  5  contains  both  situations  in  which 
Q  is  true  and  situations  in  which  Q  is  false.  Let  S\  be  a  situation  before  5  in  which  Q  is  true.  Let 
52  be  a  situation  in  (5i,5)  where  Q  is  false.  Let  53  be  a  situation  in  [Si,S)  where  Q  is  true.  In 
general,  let  5,  be  a  situation  in  (5,_i,5)  where  Q  has  the  opposite  truth  value  than  it  had  in  S,-\. 
By  lemma  23,  each  of  the  situations  51,52, .. .  must  lie  in  a  different  maximal  zone.  Hence  there 
are  infinitely  many  maximal  zones  in  the  finite  interval  (51,5).  So  Q  is  clustered  by  definition  A. 

Part  2:  Suppose  that  fluent  Q  is  clustered  by  definition  A.  Let  [51,52]  be  a  finite  interval 
containing  infinitely  many  maximal  zones.  From  each  maximal  zone  Z,  choose  a  point  Sz  €  Z.  By 
lemma26,  we  can  either  choose  an  infinite  increasing  sequence  or  an  infinite  decreasing  sequence  from 
the  5z's;  assume,  without  loss  of  generality,  that  we  have  an  increasing  sequence  S\  <  S?  <  S3  ■  ■  ■, 
where  no  two  of  the  5;'s  are  from  the  same  zone.  By  lemma  24,  if  Q  has  the  same  truth  value  in 
5,  and  Sj+i,  we  can  find  a  situation  Ti  between  5,  and  Si+i  where  Q  has  the  opposite  truth  value. 
Interpolating  these  T,  where  necessary,  we  get  a  increasing  sequence  of  points  with  alternating  truth 
values  of  Q.  Since  all  these  points  are  less  than  52,  the  sequence  must  converge  to  some  value  5. 
Thus,  any  open  interval  bounded  above  by  5  must  contain  both  points  where  Q  is  true  and  points 
where  Q  is  false,  so  Q  is  clustered  by  definition  B.D 

8.2      Analysis  of  the  examples  of  clustered  physical  behavior  in  section 
3 

Example  2:  Let  B  be  a  spherical  ball  of  radius  r0.  Let  F  be  a  funnel  whose  inner  surface  is  smooth 
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and  radially  symmetric  about  the  z-axis,  such  that  the  minimum  radius  of  a  horizontal  cross  section 
is  ro.  Let  us  call  this  cross-section  of  minimal  radius  the  "mouth"  of  F.  Suppose  that  B  is  dropped 
without  spinning  onto  the  surface  of  F.  We  wish  to  show  that  B  will  reach  the  mouth  of  F  in  finite 
time  after  bouncing  infinitely  many  times. 

Proof:  Since  the  radius  at  the  mouth  is  minimal,  it  follows  that  F  is  tangent  to  a  vertical 
cylinder  along  its  mouth.  Therefore,  if  B  is  inside  the  mouth  of  F  at  time  t,  its  velocity  at  t  must  be 
directly  vertical.  However,  an  object  moving  on  a  parabolic  curve  is  never  moving  vertically,  unless 
it  is  dropped  from  directly  above,  and  an  object  rolling  or  sliding  on  a  curved  surface  will  always 
"fly  off'  before  the  surface  becomes  vertical.  Therefore,  the  ball  cannot  reach  the  mouth,  either  in 
any  single  bounce  that  begins  on  the  surface  of  F,  or  in  rolling  along  F.  Therefore,  it  can  only  reach 
the  mouth  after  an  infinite  collection  of  bounces  on  F . 

We  can  show  that  it  eventually  does  reach  the  mouth,  by  showing  that  there  is  no  way  it  can 
indefinitely  support  itself  above  the  mouth.  There  is  no  position  of  static  equilibrium  of  the  ball  in 
the  funnel.  If  we  assume  that  objects  can  roll  perfectly  without  loss  of  energy,  then  there  might  be 
a  state  of  dynamic  equilibrium  where  the  ball  is  rolling  around  the  center  of  the  funnel  and  using 
the  centrifugal  force  to  keep  itself  up.  However,  since  the  starting  situation  has  reflective  symmetry 
in  the  plane  that  contains  the  center  of  the  ball  and  the  vertical  axis  of  the  funnel,  all  subsequent 
situations  must  have  the  same  symmetry.  In  particular,  the  ball  cannot  develop  a  velocity  around 
the  axis  of  the  funnel.  No  other  possibilities  remain. □ 

Example  3:  Consider  an  object  of  mass  M  moving  to  the  right,  and  a  ball  of  mass  m  bouncing 
between  the  object  and  a  fixed  wall.  Suppose  that  the  collisions  between  the  wall  and  the  ball  and 
between  the  object  and  the  ball  are  partially  elastic  with  coefficient  of  restitution  /i.  For  simplicity, 
we  assume  that  both  object  and  ball  move  without  friction,  so  that  the  only  loss  of  energy  is  due  to 
the  collisions  between  the  ball  and  the  object.  However,  the  same  results  hold  if  the  ball  is  bouncing 
between  two  moving  large  objects,  and  if  there  is  sufficiently  small  loss  to  friction. 

Let  v,  be  the  velocity  of  the  object  before  the  i  +  1st  collision  between  object  and  ball;  z;  be 
the  velocity  of  the  ball  after  the  i  collision  of  the  ball  with  the  wall  and  before  the  t  +  1st  collision 
of  the  ball  with  the  object,  and  wi+l  be  the  velocity  of  the  ball  after  the  i  -f  1st  collision  with  the 
object  and  before  the  i  +  1st  collision  of  the  ball  with  the  wall  (Figure  11).  All  these  velocities  are 
measured  with  positive  direction  left  to  right,  so  Wi  is  positive  and  2,  is  negative.  Let  Ui  =  —Zi  be 
the  (positive)  speed  of  the  ball  moving  to  the  left  before  the  i  +  1st  collision  with  the  object.  Then, 
before  the  i  +  1st  collision,  the  center  of  mass  has  velocity  c,  =  (Mt,  +  mzi)/(M  +  m).  Relative 
to  the  center  of  mass,  the  object  is  moving  with  velocity  v,  -  c,  =  m(vi  -  z,)/(M  +  m);  the  ball  is 
moving  with  velocity  zt  -  c,  =  M(z{  -  v,)/(  A/  +  m).  The  inelastic  collision  leaves  the  velocity  of  the 
center  of  mass  unchanged,  and  multiplies  the  velocities  of  the  object  and  ball  relative  to  the  center 
of  mass  by  a  factor  of  —  n. 

Thus,  Vi+i  - c,  =  -fim(vi  -  Zi)/(M  +  m);  u>,-+1  -  c,  =  -/iA/(z,  -  v,)/(M  +  m). 

The  collision  of  the  ball  with  the  wall  multiplies  its  velocity  by  a  factor  of  —  /*,  so  r,+i  =  -fiWi+i 
=  -p(ci  -  fiM(zi  -  Vi)/(M  +  m)). 

Putting  these  together  we  get 

vl  +  i  =  [{M  -  (im)vi  +  (1  +  n)mz,]/(M  +  m)  and 
zi+l  =  /i[-(l  +  n)Mvt  +  (piM  -  m)zi]/(M  +  m). 

Rewriting  in  terms  of  u  and  v 

i>,-+i  =  [(A/  -  nm)vi  -  (1  +  n)mui]/{M  +  m)  and 
ul+1  =fi[(l  +  fi)Mvi  +  {nM  -  m)u,]/(M  +  m). 
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Note  that  w,-+j  continues  to  be  positive  as  long  as  [(M  —  fim)vi  —  (l+/i)m«i]  >  0,  or,  equivalently, 
v,/uj  >  (1  +  fi)m/(M  -  fim). 

The  above  formulas  for  u,+i  and  Uj+i  express  them  as  linear  functions  of  v,  and  u,,  where  the 
coefficients  are  determined  by  the  constants  M ,  m,  and  //.  Suppose  we  can  find  values  of  vo  and 
uq  such  that,  for  some  a  £  (0,  1),  i>i  =  avo  and  u\  —  quq.  Then,  since  the  velocities  are  uniformly 
reduced  by  a,  we  will  have  v?  —  Qt'i,  U2  =  cuii  and,  in  general,  vt  =  a'vo,  u,  =  a'uo-  Thus, 
infinitely  many  bounces  will  occur,  with  the  velocities  approaching  zero  in  geometric  series. 

We  are  thus  looking  for  a  solution  to  the  equation  a  =  fi/t»o  =  vq/uo  or  equivalently  v^iio  =  uiVq- 
Using  the  above  formulas,  and  multiplying  through  by  M  +  m,  we  get 

[(A/  -  fitn)v0  -  (1  +  n)mu0]u0  =  ^[(1  +  n)Mv0  +  (fiM  -  m)u0]v0 
Simplifying,  we  have 

m(l  +  n)ul  +  (/T  -  l)A/u0uo  +  fi(l  +  p)Mvl  =  0 

o  in       ,, 

y.vQ  +  (1  +  ^)u0v0  +  "TTA'Wo  =  0 

Let  0i  =  Vi/ui.  Then,  solving  the  above  equation, 


Po  =  ^ 

2/; 

The  above  equation  is  solvable  as  long  as  the  determinant  (1  —  fi)2  —  4n(m/M)  >  0;  it  has  two 
roots  if  this  inequality  is  strict.  But  this  condition  always  holds  if  either  p  or  m/M  is  sufficiently 
small. 

Thus,  we  have  shown  that,  if  n  is  small  enough,  then  there  are  two  particular  ratios  Pi  and 
0u  of  velocities  that  remain  constant  after  each  iteration.  We  can  show  further  that,  in  fact,  the 
original  ratio  does  not  have  to  correspond  exactly  to  these  values;  as  long  as  the  original  ratio  vq/uq 
is  greater  than  the  lower  of  the  two  roots  0i,  the  sequence  of  ratio  w,-/«j  will  converge  to  the  greater 
of  the  roots  0y .  To  show  this  let  us  return  to  the  above  equations  for  Vi+\  and  u,-+i.  Dividing  one 
equation  by  another,  and  simplifying  gives  us 

Q       _     (M-nm)0j  -(l  +  /i)m 
P,  +  1       /i[(l+/i)A//?,  +  (/iM-m) 

This  defines  0i+i  as  a  function  of  0,,  /?1+1  =/(/?,-).  The  graph  of  the  function  is  a  right  hyperbole, 
with  horizontal  and  vertical  asymptotes,  whose  right-hand  branch  is  concave  downward  (Figure  12). 
We  are  interested  in  cases  where  the  right-hand  branch  intersects  the  line  0,+\  =  /?,-.  Now  consider 
the  sequence  a,  /(a),  /(/(a))  ...This  sequence  may  be  generated  by  starting  at  the  point  on  the 
hyperbole,  with  0,  —  a,  drawing  a  horizontal  line  to  meet  the  line  0,+\  =  0i,  and  drawing  a  vertical 
line  from  the  intersection  point  to  the  hyperbole.  It  is  geometrically  obvious  and  easily  shown 
analytically  that,  if  a  starts  between  the  two  intersection  points  0i  and  0u ,  then  the  sequence 
converges  upward  toward  0u ,  while  if  a  starts  above  0u ,  then  it  converges  downward  toward  0u . 

Finally,  we  must  make  sure  that  the  time  between  bounces  converges  toward  zero  in  geometric 
ratio.  Let  di  be  the  "slack  space"  at  the  time  of  the  i-th  collision;  that  is,  the  distance  from  the  object 
to  the  wall  minus  the  diameter  of  the  ball.  Let  <,  be  the  time  between  the  ith  and  i  4-  1st  collisions. 
Between  the  ith  and  the  i+  1st  collisions,  the  object  covers  a  distance  di  —  d,+i  going  at  speed  Vi;  the 
ball  goes  the  distance  di  from  the  object  to  the  wall  at  speed  Ui/n  and  the  distance  di+ifrom  the  wall 
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to  the  object  at  speed  U{.  Therefore,  we  have  ti  -  (di  -  di+i)/vt  =  (/id,  +d1+i)/u;.  Solving  the  last 
equation  gives  d1+1/<f,  =  (u,  - (iv,  )/(u(  +  Vi).  In  order  that  the  times  may  diminish  geometrically,  we 
must  ensure  that  t,+l/ti  <  c  <  1.  This  condition  is  guaranteed  if  di+i/di  <  cvi+i/vi.  Substituting 
the  above  equation,  we  get  (u<  —  pvi)/(ui  +  v,)  <  cv,+i/t),-.  Assuming  that  i>i/u,  has  converged  to 
the  value  0,  as  discussed  above,  we  have 

(1  -  /*/?)/(  1  +  /?)  <  vi+i/vi  =  {(M  -itm-  (1  -  n)m/0\/M  +  m 

Simplifying,  we  get  (1  +  fi)02 M  >  (1  +  2/?-  /i)m.  From  above,  we  know  that  /?y  >  (1  - /j)/2/i, 
so  this  constraint  will  be  satisfied  as  long  as  fi  is  sufficiently  small. Q 

8.3      Discrete  variation  in  real-valued  functions 

In  this  section,  we  define  the  class  of  piecewise  Dirichlet  functions,  show  that  they  give  discrete 
variation  for  the  states  f(x)  >  c  and  f(x)  =  c,  and  show  that  they  are  closed  under  arithmetical 
operations,  exponentiation,  and  differentiation. 

Definition  28:  A  Dmchlel  series  is  a  series  of  the  form  Yl'tLo  aixX'  where 

i  A,  <  A^+1  for  all  i\ 
ii.  A,  goes  to  infinity  as  i  goes  to  infinity;  and 
iii.  the  series  converges  absolutely  for  some  x  >  0. 

(Remark  on  standard  terminology;  the  Dirichlet  series  is  more  usually  defined  in  terms  of  a 
variable  u  =  —  logx.  Under  that  variable  transformation,  the  series  takes  the  form  £2a,e-A,u.  Also, 
the  standard  definition  of  a  Dirichlet  series  requires  only  convergence,  not  absolute  convergence.) 

Examples: 

i.  The  finite  series  x-1  is  a  Dirichlet  series. 

ii.  The  series  x1  +  x^  +  x^  ...  is  a  Dirichlet  series.  To  show  that  it  converges  absolutely  in  the 
interval  [0,1),  observe  that,  for  x  <  1,  the  first  through  third  terms  are  less  than  or  equal  to 
x;  the  fourth  through  eighth  terms  are  less  than  or  equal  to  x2  . .  .in  general,  the  terms  in  the 
k~  through  (it  +  l)2  -  1  terms  are  less  than  or  equal  to  xk .  Therefore,  the  series  as  a  whole  is 
bounded  above  by  22x  +  32x2  +  42x3  . . .,  which,  by  the  ratio  test,  converges  absolutely  in  the 
interval  [0,1). 

iii.  The  series  cotangent(x)=  x"1  -  x/3  +  x3/45  ...  is  a  Dirichlet  series. 

iii.  The  series  sin(l/x)  =  x"1  -  x_3/3!  +  x-5/5! ...  is  not  a  Dirichlet  series,  since  the  exponents 
decrease  rather  than  increase. 

iv.  The  series  x1/2/l!  +  x3/4/2!  +  x7/8/3! ...  is  not  a  Dirichlet  series,  even  though  it  converges  for 
all  x  >  0,  since  the  exponents  approach  1  rather  than  oo. 

Definition  29:  The  function  /(x)  is  Dirichlet  above  x0  if  there  is  a  Dirichlet  series 

oo 

Y,  a,(x-x0)A' 
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that  converges  absolutely  to  /(x)  in  some  open  interval  (xo.xi).  /(*)  is  Dirichlet  below  xo  if  there 

is  a  Dirichlet  series 

oo 

]T  ai(x0-x)x- 

i  =  0 

that  converges  absolutely  to  f(x)  in  some  open  interval  (xi,xo). 

Lemma  30:  If  f(x)  is  analytic  at  xo,  then  f(x)  is  Dirichlet  above  and  below  xo- 

Proof:  Immediate  from  the  definitions. D 

Theorem  31:  If  f(x)  is  Dirichlet  above  xo,  with  a  series  that  converges  in  the  interval  (xo,xi), 
then  f(x)  is  analytic  in  the  interval  (xq,Xi).  (Note:  the  converse  does  not  hold.) 

Proof:  Let  xc  be  any  point  in  (xo,Xi).  Each  term  of  the  Dirichlet  series  a,(x  — x0)A'  is  analytic  in 
the  neighborhood  of  xc  and  therefore  can  be  written  as  a  power  series  £^  dij  (x  —  xc  )J  that  converges 
absolutely  in  the  neighborhood  of  xc.  We  can  therefore  write 


oo       oo 


t  =  0   ;'=0  j=0   i=0 

Since  the  first  sum  above  is  the  absolutely  convergent  sum  of  absolutely  convergent  series,   the 
rearrangement  of  the  order  is  correct. □ 

Definition  32:  A  function  /(x)  is  Dirichlet  in  the  open  interval  (xo,xi)  if/  is  Dirichlet  above 
xo,  Dirichlet  below  X\,  and  analytic  throughout  (xo.Xi).  The  function  /(x)  is  Dirichlet  in  an  interval 
/  that  is  half-open  or  closed  if  /(x)  is  Dirichlet  in  some  open  interval  containing  /. 

Example:  The  function  /(x)  =  l/x(x  -  1)  is  Dirichlet  in  the  interval  (0,1). 

Definition  33:  Let  f\,fa...  be  a  collection  of  functions  and  let  P  be  a  mapping  such  that 

i.  The  conditions  of  definition  8  are  satisfied, 
ii.  /,(x)  is  Dirichlet  in  the  interval  P(fi). 

Then  the  splice  of  P  is  said  to  be  piecewise  Dirichlet. 

Observation:  Any  Dirichlet  series  can  be  rewritten  in  normalized  form  Cx^(l+CiX°,+C2X0'3  . . .) 
where  0  <  cri  <  a?  <  . . .  by  factoring  out  the  leading  term.  We  will  call  Cx^  the  leading  factor, 
and  call  1  +  cix°'  +  C2X°3  . . .  the  series  factor.  It  is  easily  shown  that  the  series  factor  approaches 
1  as  x  goes  to  zero. 

Lemma  34:  If  /(x)  is  Dirichlet  above  x0,  then  there  is  an  interval  (x0,xi)  in  which  /(x)  is 
either  always  positive,  always  negative,  or  always  zero. 

Proof:  Put  the  Dirichlet  series  in  normalized  form,  as  above.  Since  the  series  factor  converges 
to  1,  we  may  choose  an  i  small  enough  that  1  +  C\tai  +  c2ta*  ...  is  greater  than  0  for  all  t  €  (0,t). 
Then,  for  x  €  (xO,  xO  +  e),  the  series  factor  is  always  positive,  and  the  leading  factor  always  has  the 
sign  of  the  constant  coefficient  C,  so  f(x)  has  the  same  sign  as  C  throughout  (xO,  xO  +  ( ).□ 

Theorem  35:  If  /(x)  is  piecewise  Dirichlet,  then  it  has  discrete  variation. 

Proof:  Immediate  from  Lemma  34  and  Theorem  5.D 

Lemma  36:  If  /(x)  and  g(x)  are  both  Dirichlet  above  xo,  or  both  are  Dirichlet  below  xo,  then 
so  are  f(x)  +  g(x),  f(x)  -  g(x),  and  /(x)  •  g(x).  The  functions  g(x)/f(x),  is  Dirichlet  above  (below) 
xo  unless  /(x)  is  identically  zero  in  an  interval  above  (below)  x0.   The  function  /(x)°  is  Dirichlet 
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above  (below)  x0  if /(^o)  >  0  and  f(x)  is  not  identically  zero  in  an  interval  above  (below)  xq-  The 
derivative  /'(x)  is  Dirichlet. 

Proof:  In  each  case,  the  new  formal  series  is  easily  calculated  from  the  old  series,  and  absolute 
convergence  follows  from  standard  theorems. 

Addition  and  subtraction:  Add/subtract  term  by  term.  If  two  series  converge  absolutely,  then 
their  sum  does  as  well.  [Knopp,  p.   138] 

Multiplication:  Multiply  term  by  term.  If  two  series  converge  absolutely,  then  their  product  does 
as  well  in  the  smaller  of  the  two  intervals  of  convergence.  [Knopp,  p.   146] 

Exponentiation:  Express  f(x)  in  normal  form:  f(x)  =  Cx0(\  +  Cix°'  +  c-ixa'i  . . .)  where  C  ^  0. 
Suppose  that  /  converges  absolutely  in  the  interval  (0,r).  We  have 

fi(x)  =  C  i*0  (1  +  cix"'  +  c2xa>  . . .)« 

Let  z  =  (cix°'  +  cii"3  . . .)  Note  that  z  goes  to  0  as  x  goes  to  zero.  Then,  by  the  binomial  theorem, 
the  third  factor  above  has  the  form 

(l  +  .-)<  =  l  +  ?r  +  fcih2  +  ... 

Moreover,  this  series  converges  absolutely  for  |  r  |<  1  [Knopp,  p.  208].  Since  :(x)  goes  to  zero  when 
x  goes  to  zero,  this  condition  holds  for  small  enough  z. 

By  the  remark  on  multiplication  above,  each  power 

z(x)k  =  (cix°'  +  c3i°!  ...)•...    (cix0"  +  cnx"2  ...)  (k  times) 

can  be  expanded  as  a  series  in  x,  which  converges  absolutely  within  the  same  radius  as  z(x).  Substi- 
tuting these  in  the  above  series,  we  have  the  term  factor  of  /(x)'  expressed  as  a  sum  of  terms  each 
of  which  is  a  power  series  in  x.  Two  points  remain  to  be  established: 

1.  If  x  is  chosen  small  enough  so  that  the  series  for  z(x)  converges  absolutely  to  a  value  less 
than  1,  then  the  overall  series  is  the  absolutely  convergent  sum  of  a  collection  of  absolutely 
convergent  series,  and  hence  converges  absolutely. 

2.  We  must  show  that  the  series  can  be  rearranged  into  descending  powers  of  x  that  go  to  minus 
infinity.  An  equivalent  condition  is  that  for  any  M,  there  are  only  finitely  many  terms  x6 
where  6  <  M .  This  is  immediate,  since  the  smallest  exponent  in  the  expansion  of  r(x)  is  ken. 
Hence,  powers  of  x  that  are  less  than  M  are  generated  only  from  series  earlier  than  ;(x)    '"'  .□ 

Quotient:  Rewrite  g(x)/f(x)  as  g(x)f(x)~1  and  apply  the  above  results  on  multiplication  and 
exponentiation. 

Differentiation:  Differentiate  term  by  term.  Convergence  is  immediate  from  the  ratio  test.D 

Theorem  37:  If /(x)  and  g(x)  are  piecewise  Dirichlet,  then  then  so  are  f(x)  +  g(x),  f(x)-g(x), 
f(x)  ■  g(x).  The  functions  g(x)/f{x),  and  f(x)a  are  piecewise  Dirichlet  unless  /(x)  is  identically 
zero  over  some  finite  interval.  The  derivative  f'(x)  is  piecewise  Dirichlet,  assuming  some  convention 
for  defining  the  derivative  at  points  where  the  function  is  not  differentiable. 

Proof:  Immediate  from  lemma  36  and  theorem  6.0 

Still,  there  are  well-behaved  functions,  like  log(x)  or  e_1/r\  that  are  not  representable  by  Dirich- 
let series  at  x  =  0.  It  can  be  shown  that  there  are  larger  classes  of  functions  that  contains  these, 
as  well  as  all  the  piecewise  Dirichlet  functions;  that  is  closed  under  the  operations  of  theorem  35; 
and  that  is  guaranteed  to  vary  discretely.  However,  the  description  of  this  set  and  proof  of  these 
properties  is  so  long  and  of  such  limited  interest  that  I  have  not  included  them  here. 
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8.4      Spatial  Fluents 

In  this  section  we  prove  theorem  15  from  section  4.4.  Informally,  the  theorem  states  that,  in  a 
situation  where  all  objects  have  semi-algebraic  shapes  and  piecewise  analytic  motions,  any  Boolean 
fluent  that  can  be  stated  in  an  algebraic  language  varies  discretely.  The  formal  statement  of  this 
result  is  given  as  theorem  44  below.  Viewed  from  the  proper  perspective,  this  result  is  an  almost 
trivial  consequence  of  a  classic  result  of  Tarski  [1951],  stating  that  the  solution  space  for  any  open 
formula  in  an  algebraic  language  is  a  semi-algebraic  set  (theorem  42  below),  combined  with  theorem 
17.  This  section,  thus,  consists  of  setting  up  that  perspective. 

A  position  of  an  object  in  n-dimensional  space  can  be  characterized  in  terms  of  a  mapping  from 
3?"  to  3?n.  (If  the  object  is  rigid,  then  the  mapping  is  orthonormal.)  A  motion  of  the  object  through 
time  is  therefore  a  function  from  time  to  mappings,  or,  in  other  words,  a  mapping  from  3?1  x  9?" 
to  3?".  It  will  be  convenient  to  consider  motions  that  are  governed  by  a  collection  of  real- valued 
parameters,  p\  .  ■  -fk-  Each  of  these  parameters  will  eventually  be  viewed  as  a  function  of  time,  but 
initially  it  will  be  useful  to  consider  them  as  independently  varying  parameters.  We  can  therefore 
adopt  the  following  definition: 

Definition  38:  A  k-parameter  motion  in  n- dimensional  space  is  a  mapping  from  3J  x  5?"  to 
3f".  If  M  is  such  a  mapping,  we  write  M{p\  . .  -Pk,x)  =  y 

Definition  39:  A  mapping  M  from  5fa  to  J?6  is  algebraic  if  each  coordinate  in  M(£)  is  an 
algebraic  function  (multinomial)  of  the  coordinates  of  x.  (It  is  easily  shown  that  whether  or  not  a 
mapping  is  algebraic  is  independent  of  the  coordinate  system. 

Definition  39  is  applied  to  ^-parameter  motions  by  taking  a  to  be  k  +  n  and  6  to  be  n. 

Example:  Let  n  =  3  and  k  =  9.  The  it-parameter  motion 

M(x)  = 


Pi 

P2 

P3 

X\ 

Pa 

P5 

P6 

x-i 

P7 

PS 

P9  . 

.  X3 

is  algebraic. 

For  any  fixed  value  of  the  parameters,  a  ^-parameter  motion  defines  a  function  from  3?"  to  5?n. 
This  is  extended  in  the  natural  way  to  define  a  mapping  from  n-dimensional  regions  to  n-dimensional 
regions.  If  R  is  a  region  in  3?n,  and  M  is  a  fc-parameter  motion,  we  write  "M(pi  . .  .pt,  R)"  to  mean 
the  image  of  R  under  the  mapping  with  the  particular  parameter  values  p\  . .  .p*. 

Definition  40:  The  language  of  real  algebra  is  the  first-order  language  with  equality  and  with 
the  non-logical  primitives  '0',  '1',  'plus(A",  Y)\  'times(AT,  Y)\  and  'greater(A',  Y)\  where  the  domain 
is  taken  to  be  the  real  numbers  and  the  primitives  are  given  their  usual  interpretation. 

Definition  41:  Let  Rj  .  ..R„,  be  m  regions  in  3?".  A  quantification  over  R,  is  a  restricted 
quantification  of  the  form  Q<x  x  >eR  ,  where  Q  is  either  the  universal  or  existential  quantifier. 
The  property  4>(Ri  . .  .Rm)  is  an  algebraic  property  of  the  regions  if  it  is  equivalent  to  a  formula 
written  in  the  language  of  real  algebra,  together  with  quantifications  over  the  R^. 

Examples: 

i.  The  property  "R  and  S  overlap"  is  algebraic,  since  it  can  be  written  (in  two  dimensions)  in 
the  form 

3<w.A>eR  3<v,z>eS  W  =  YAX  =  Z 

ii.  The  property  "The  minimum  distance  from  R  to  S  is  equal  to  D"  is  algebraic,  since  it  can 
be  written  (in  two  dimensions)  as 
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[V<^eR  V<v,z>eS  (^  -  n  ■  (^  -  >')  +  (X  -  Z)  •  (X  -  Z)  >  D  •  D]A 
We>d  3<vv,x>€r  3<y,z>€s  (^  -  y)    (W-Y)  +  (X-Z)(X-Z)<EE] 
(The  subtractions  above  can  easily  be  expressed  in  terms  of  plus.) 

iii.  The  property  "The  volume  of  R  is  greater  than  3,"  is  not  an  algebraic  property.    Note  that 
the  basic  definition  of  volume  involves  an  integral,  which  is  not  an  algebraic  operation. 

We  now  invoke  the  classic  theorem  of  Tarski's  on  the  solvability  of  real  algebra. 

Theorem  42:  Let  <f>  be  a  formula  in  the  language  of  real  algebra  with  k  free  variables  ac\  . .  .a*. 
Let  a  =<  ai  . .  .at  >  be  a  fc-tuple  of  values  for  the  a,-.  The  solution  space  of  <j>  —  that  is,  the  set 
of  values  of  a  for  which  <j>  is  true  —  is  a  semi-algebraic  region  in  $tk . 

Proof:  See  [Tarski,  51]. 

The  following  corollary  is  just  a  special  case  of  Tarski's  theorem. 

Corollary  43:  Let  Ri...R,  be  q  fixed  semi-algebraic  regions  in  3?".  Let  M\...Mq  be  q 
different  algebraic  parameterized  motions.  Let  a\  . .  .at  be  the  parameters  of  all  the  Mi.  Let  $  be 
an  algebraic  property  of  q  regions.  Thus  the  formula  $(MiRi  . . .  MqKq)  is  an  open  formula  whose 
free  variables  are  the  cti.  The  solution  set  of  this  open  formula  is  a  semi-algebraic  region  in  5R*. 

Proof:  Since  each  region  R^  and  each  motion  Mi  is  algebraic,  the  restricted  quantification  over 
the  range  <  X\  . .  .x„  >6  M(pn  . .  .p«i,R»)  can  be  expressed  as  a  quantification  over  the  x,,  subject 
to  an  algebraic  constraint  over  the  x;  and  pj.  Thus,  the  whole  condition  $  can  be  expressed  as  a 
formula  in  the  language  of  real  algebra  with  the  parameters  as  free  variables.  By  Tarski's  theorem, 
the  solution  space  of  this  open  formula  is  a  semi-algebraic  region. D 

Theorem  44:  As  in  corollary  42,  let  Ri  . . .  R,  be  q  semi-algebraic  regions  in  3?n;  let  M\  . .  .Mq 
be  q  different  algebraic  /t-parameter  motions;  let  ot\  . .  .at  be  the  combined  parameters  of  the  M<; 
and  let  $  be  an  algebraic  property  of  q  regions.  Furthermore  assume  that  each  of  the  k  parameters  is 
a  piecewise  analytic  function  of  time.  Then  the  fluent  "<I>  holds  on  Mi(i)(Ri) ..  .M,(<)(R,)"  varies 
discretely. 

Proof:  Consider  the  configuration  space  3?z  of  parameter  values.  By  corollary  41,  the  set  of 
values  satisfying  $  is  semi-algebraic.  Let  x(t)  be  the  "configuration  point"  of  the  tuple  of  all  the 
parameter  values  over  time.  Clearly,  since  each  parameter  is  semi-analytic,  so  is  £(<).  Applying 
theorem  17,  we  conclude  that  the  truth  of  4>  varies  discretely. D 

Example:  Let  0\  be  an  object  that  occupies  region  R\  at  time  t  =  0  and  that  is  rotating 
around  point  c~[  with  constant  angular  velocity  u\.  Define  O2,  R2,  c2,  u>2  be  likewise.  Assume  that 
Ri  and  R2  are  semi-algebraic.  Then  the  motion  of  0\  is  characterized  by  the  mapping  M\(t,x)  — 
c[  +  Ai(t)(x  —  c{)  where  Ai(t)  is  the  matrix 

cos(wi<)     -sin(wii) 
sin(u>i/)       cos(u>i<) 

.  The  motion  of  02  is  analogous.  We  now  define  the  four  parameters  Qi(t)  =  sin(wi<),  a2(0  = 
cos(wii),  a3(/)  =  sin(w2<).  Q4(0  =  cos(w2<).  Since  M  is  an  algebraic  mapping  over  the  Oj  and  x, 
we  can  apply  theorem  44,  and  infer  that  states  such  as  "Oi  overlaps  02"  vary  discretely.  Note  that, 
since  the  mapping  M  is  a  trigonometric  function  of  1,  we  cannot  apply  corollary  43  directly  to  the 
parameter  t. 


9      References 

Brooks,  R.  (1991)  "Intelligence  without  Reason."  Computers  and  Thought  Lecture,  Proc.  ICJAI-91. 

25 


A1(t)  = 


Davis,  E.  (1988)  "A  Logical  Framework  for  Solid  Object  Physics."  AI  in  Engineering,  vol.  3  no.  3, 
pp.  125-140. 

Davis,  E.  (1989a).  "Solutions  to  a  Paradox  of  Perception  with  Limited  Acuity,"  in  R.  Brachman, 
H.  Levesque,  and  R.  Reiter  (eds.)  First  International  Conference  on  Knowledge  Representation  and 
Reasoning,  Morgan  Kaufmann,  1989. 

Davis,  E.  (1989b)  "Order  of  Magnitude  Reasoning  in  Qualitative  Differential  Equations"  In  J.  de 
Kleer  and  D.  Weld  (eds.),  Readings  in  Qualitative  Physical  Reasoning,  Morgan  Kaufmann,  pp.  422- 
434. 

Davis,  E.  (1990)  "Physical  Idealization  as  Plausible  Inference."  Tech.  Rep.  534,  NYU  Comp.  Sci. 
Dept. 

Davis,  E.  (1991)  "The  Kinematics  of  Cutting  Solid  Objects."  Tech.  Rep.  541,  NYU  Comp.  Sci. 
Dept. 

Fleck,  M.  (1988)  "Boundaries  and  Topological  Algorithms."  M.I.T.  AI  Lab,  Tech.  Rep.  1065. 

Forbus,  K.  (1985)  "Qualitative  Process  Theory."  In  D.  Bobrow  (ed.)  Qualitative  Reasoning  about 
Physical  Systems.  MIT  Press. 

Galton,  Anthony  (ed.)  (1987)  Temporal  Logics  and  Their  Applications.  Academic  Press. 

Hamblin,C.L.  (1971)  "Instants  and  Intervals,"  Siudtum  Generate,  vol.  24,  pp.   127-134. 

Hayes,  P.  (1985)  "The  Second  Naive  Physics  Manifesto."  In  J.  Hobbs  and  R.  Moore  (eds.)  Formal 
Theories  of  the  Commonsense  World.  Ablex  Pubs. 

Kaufmann,  S.  (1991)  "A  Formal  Theory  of  Spatial  Reasoning."  in  J.  Allen,  R.  Fikes,  and  E. 
Sandewall  (eds.)  Proc.  Second  Intl.  Conf.  on  Princ.  of  Knowledge  Representation  and  Reasoning. 
Morgan  Kaufmann 

Knopp,  Konrad.  (1928)  Theory  and  Application  of  Infinite  Series.  Blackie  and  Sons,  Ltd.  London. 

Manna,  Z.  and  R.  Waldinger,  (1986).  "A  Theory  of  Plans,"  in  M.  Georgeff  and  A.  Lansky  (eds.) 
Reasoning  about  Actions  and  Plans.  Morgan  Kaufmann. 

McDermott,  D.  (1982)  "A  Temporal  Logic  for  Reasoning  about  Processes  and  Plans,"  Cognitive 
Science,  vol.  6,  pp.  101-155. 

Russell,  B.  (1919)  Introduction  to  Mathematical  Philosophy.  George  Allen  and  Unwin,  Ltd.,  London. 

Tarski,  A.  (1951)  A  Decision  Method  for  Elementary  Algebra  and  Geometry.  University  of  California 
Press. 


26 


NYU  COHPSCI  TR-599    c  ^ 
Davis,  Ernest 
Infinite  loops  in  finite 
time:  some  observations. 


-NYU  COMPSCI  TR-599 

Davis,  Ernest 
_  Infinite  loops  in  finite 
time:  some  observations. 


LIBRARY 

N.Y.U.  Courant  Institute  of 

Mathematical  Sciences 

251  Mercer  St. 
New  York,  N.  Y.    10012 

This  book  ».y  be  kep,  M/\R13    WQ? 

FOURTEEN    DAYS     WC 

A  fine  will  be  charged  for  each  day  the  book  is  kept  overtime. 


r — *m 

TSWt 

-i 

IM 

a 

<r 
0 

J 
>- 
< 
0 

OtSi 


8EJMB 
9&* 


Us  n 

IS  isi  ■•^s 


